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Prove the worth of basic research 


European agencies are backing fundamental science and working to prove that it pays off. Other 
national and international bodies should follow suit. 


gold dust for scientists, and help them to push back against 

political demands for applied work. Who doesn't know by now 
that we have basic research to thank for the World Wide Web? Who 
hasn't heard that curious researchers trying to work out how bacteria 
biochemically tick stumbled on the CRISPR-Cas9 gene-editing 
techniques that have gone on to transform biotechnology? 

Still, political support for a thriving fundamental research base 
cannot be taken for granted. So two unexpected — and quite differ- 
ent — moves announced this month are worth noting and celebrating. 

On 15 July, the hard-nosed European Investment Bank, which lends 
with favourable terms to European Union member states to support 
EU policy objectives, gave a massive loan to Greece to start up an 
agency for basic research. This not only provides a much needed 
moral boost for Greece, which has had to live for years with the label 
‘credit-unworthy;, but it also sends a crystal-clear message to politi- 
cians around the world on the clear importance of pure science to a 
secure economic future. 

Then, at the biennial European Open Science Forum in Manchester, 
UK, on 26 July, European Research Council (ERC) president Jean- 
Pierre Bourguignon announced that the council will start to monitor 
the outcomes of the research it funds. The ERC, which was founded 
in 2007 and awards sought-after grants that confer immense prestige 
on recipients, aims to systematically build a body of evidence to dem- 
onstrate the value of pure research beyond well-celebrated examples 
such as those mentioned above. 

In the past 18 months, the ERC has quietly carried out a pilot effort 
to evaluate 199 of its first completed projects. It did not take the easy 
option of just looking at bibliometrics. It wisely took the more inform- 
ative but more difficult option of asking experts not to get hung up on 
numbers, but to make judgements based on their expertise. They had 
to grade the scientific success of each project and assess its impact on 
the world outside science. 

The results? The ERC seems to be a resounding success. (Although 
most of the reviewers had worked with the council before and so 
cart be classed as wholly independent.) Almost three-quarters of the 
projects were judged to have generated a scientific breakthrough or 
major scientific advance, and one-quarter had — or might have in 
the future — an impact on the economy, society or policymaking. 
The exercise cost a mere €200,000 (US$220,000), a tiny fraction of 
the ERC budget. 

This is a very small qualitative study that has some flaws (see 
page 477), and the results cannot be extrapolated to the 6,000 or more 
grants, worth €9.8 billion, that the ERC has so far paid out. But the 
evaluation process is itself under constant review and many of its flaws 
should be ironed out in future rounds. 

The results of the pilot will surprise few scientists, given the well- 
honed and widely admired selection procedures of the ERC. But as 


Te happy accidents that come from blue-skies research are 


the years go by, they will add up to a convincing portfolio to present 
to politicians, showing that ERC spending on basic research is not 
wasted — it usually leads to scientific success, which in turn often 
leads to positive outcomes for society. 

This type of retrospective audit is rare. And it is perhaps surprising 
that national research agencies around the world do not do it. The DFG 
in Germany, for example, feels that its own selection processes are reli- 

able enough not to require further proof of this 


“The struggle type — but then, in Germany, basic research 
between is unusually well protected from the vagaries 
politicians and of politics. The time may be ripe for a modest 
fundamental investment like the ERC’s to be more widely 
researchers is applied. 

eternal.” The struggle between politicians and 


fundamental researchers is eternal, and 
understandably so. In democracies, politicians have to demonstrate to 
their electorates every five years or so that they have presided over serial 
successes and have not thrown away taxpayers’ money on self-indulgent 
frippery. The scientific community has to find ways to continually show 
them that it is producing some of the successes. The strong endorsement 
of basic research by the European Investment Bank is a useful card that 
can be widely played to this end. And the ERC’s example is one to follow: 
gather evidence for the worth of evidence-based arguments. = 


Cures for all 


US lawmakers should give drug firms the 
confidence to test cancer therapies in children. 


some comfort in the numerous treatments available to them — 

both through clinical trials and as drugs that are already on the 
market. Children cannot. Because they make up only 1% of US patients 
with cancer, children are a low priority for pharmaceutical companies 
that want to launch an effective drug quickly. The hassle of a paediatric 
clinical trial may not seem worth it until after the drug has proved to 
be safe and effective in adults. This process can take decades, leaving 
children with therapies that are sometimes almost obsolete. 

To access therapies early, parents of these children can turn to com- 
passionate-use programmes, in which companies give experimental 
drugs to people who are in desperate need. In the United States, firms 
that agree to provide medicines in this way will ask the Food and Drug 
Authority for emergency permission, which is almost always granted. 

This system, although helpful for some, is rife with complications. 


A cancer diagnosis is a shock, but adults with the disease can take 
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Patients and their families report difficulties in applying for such 
programmes, and say that they rarely receive responses. Companies that 
withhold a drug — because it is in short supply or not right for a patient 
— can find themselves on the receiving end of critical social-media cam- 
paigns highlighting individual patients. And firms worry that ifa person 
dies or is harmed while taking a drug, it could hurt the drug’s chances of 
being approved. No one knows how many requests parents make and 
how often companies approve them, but anecdotally, firms often deny 
drugs on the grounds that they have not been tested in children. 

Proper clinical trials for childhood cancer drugs are scarce. Designing 
a clinical trial is never simple, but adding children to the picture compli- 
cates the process immensely. Children are not just ‘small adults’ — they 
metabolize drugs in very different ways. It is difficult to predict from 
adult or animal studies whether a chemotherapy drug will be more or 
less toxic in a child, and at what dose. The process of obtaining informed 
consent for children participating in a trial can also be more compli- 
cated. And companies fear that the death ofa child — even if unrelated 
to the treatment — could bring bad publicity for a new drug. 

Recent years have seen attempts to make more drugs available to 
treat children. In the United States, a 2003 law known as the Pediatric 
Research Equity Act (PREA) requires that companies develop a plan 
for how they will test experimental drugs in children, although many 
trials are exempted. A second law, called the Best Pharmaceuticals for 
Children Act, motivates companies to perform paediatric clinical trials 
by granting an extra six months of market exclusivity for the adult drug. 

Overall, these laws have been successful, leading to hundreds of drug 
labels being updated with information for use in children. But legal 
loopholes often prevent children with cancer from accessing new drugs. 
For instance, therapies for conditions that do not affect children — such 
as Alzheimer’s disease — are exempt from the PREA. And exemptions 


intended for such diseases have been broadly applied to cancer. For 
example, therapies that are being trialled in adults with breast cancer 
are exempted because children do not get that cancer, even if the drug 
could treat a childhood cancer in a different organ. 

Also exempted are drugs for ‘orphar diseases that affect fewer than 
200,000 people in the United States. The number of orphan desig- 
nations has skyrocketed in recent years — the improved ability to 

define the molecular basis of an individual's 


“Legal loopholes cancer means that diagnoses have become 
often prevent increasingly subdivided, and the majority 
children with of approved cancer drugs now carry this 
cancer from orphan designation. 

accessing new Legislation is now attempting to close those 
drugs. * loopholes. The Research to Accelerate Cures 


and Equity (RACE) for Children Act, intro- 
duced to the US Congress on 14 July, would require companies to apply 
the PREA to any therapy with a molecular target that is relevant to both 
an adult and a childhood disease. It would also end the exemption for 
orphan diseases. Last July, the European Medicines Agency passed simi- 
lar rules to make it more difficult for companies to avoid testing drugs 
in children. This applies when the disease has a common mechanism 
in adults and children, unless the drug is likely to be unsafe in children. 
With Congress now out of session and focused on the upcoming 
US election, the RACE for Children Act is unlikely to advance before 
next year. But when lawmakers pick it up, they should also address 
problems with compassionate-use programmes — and ensure a trans- 
parent and useful process for people to gain access to unapproved 
drugs. They should also encourage companies to make more drugs 
available through market incentives, and provide increased protec- 
tion should something go wrong. = 


On impact 


Nature and the Nature journals are diversifying 
their presentation of performance indicators. 


etrics are intrinsically reductive and, as such, can be 
Mises Relying on them asa yardstick of performance, 

rather than as a pointer to underlying achievements 
and challenges, usually leads to pathological behaviour. The journal 
impact factor is just such a metric. 

During a talk just over a decade ago, its co-creator, Eugene Garfield, 
compared his invention to nuclear energy. “I expected it to be used 
constructively while recognizing that in the wrong hands it might be 
abused,” he said. “It did not occur to me that ‘impact’ would one day 
become so controversial.” 

As readers of Nature probably know, journal impact factors measure 
the average number of citations, per published article, for papers pub- 
lished over a two-year period. Journals do not calculate their impact 
factor directly — it is calculated and published by Thomson Reuters. 

Publishers have long celebrated strong impact factors. It is, after all, 
one of the measures of their output’s significance — as far as it goes. 

But the impact factor is crude and also misleading. It effectively 
undervalues papers in disciplines that are slow-burning or have 
lower characteristic citation rates. Being an arithmetic mean, it gives 
disproportionate significance to a few very highly cited papers, and 
it falsely implies that papers with only a few citations are relatively 
unimportant. 

These shortcomings are well known, but that has not prevented 
scientists, funders and universities from overly relying on impact 
factors, or publishers (Nature’s included, in the past) from excessively 
promoting them. As a result, researchers use the impact factor to help 
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them decide which journals to submit to — to an extent that is under- 
mining good science. The resulting pressures and disappointments 
are nothing but demoralizing, and in badly run labs can encourage 
sloppy research that, for example, fails to test assumptions thoroughly 
or to take all the data into account before submitting big claims. 

The most pernicious aspect of this culture, as Nature has pointed 
out in the past, has been a practice of using journal impact factors as 
a basis for assessment of individual researchers’ achievements. For 
example, when compiling a shortlist from several hundred job appli- 
cants, how easy it is to rule out anyone without a high-impact-factor 
journal in their CV. 

How to militate against such a metrics-obsessed culture? 

First, an approach that some have applied in the past and whose 
time has surely come. Applicants for any job, promotion or funding 
should be asked to include a short summary of what they consider 
their achievements to be, rather than just to list their publications. This 
may sound simplistic, but some who have tried it find that it properly 
focuses attention on the candidate rather than on journals. 

Second, journals need to be more diverse in how they display their 
performance. Accordingly, Nature has updated its online journal 
metrics page to include an array of additional bibliometric data. 

Asa part of this update, for Nature, the Nature journals and Scientific 
Reports, we have calculated the two-year median — the median number 
of citations that articles published in 2013 and 2014 received in 2015. 
The median is not subject to distortion by outliers. (The two-year 
median is lower than the two-year impact factor: 24, down from 38, 
for Nature, for example.) For details, see go.nature.com/2arq7om. 

Providing these extra metrics will not address the problem 
mentioned above of the diversity in citation characteristics between 
disciplines. Nor will it make much of a dent in impact-factor 
obsessions. But we hope that it will at least provide a better means of 
assessing our output, and put the impact factor in a better perspective. 

However, whether you are assessing journals or researchers, nothing 
beats reading the papers and forming your own opinion. = 
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BRITISH COUNCIL 


WORLD VIEW .cnsiscorsen 


to establish CERN, Europe’ particle-physics lab, in May 1952. 

At the time, and in the earlier years when the project was being 
planned and launched, Britain’s official position was that European 
cooperation offered little benefit to British physics, and the country 
preferred to go it alone. 

Sound familiar? One month on from the 23 June vote for the United 
Kingdom to leave the European Union, the shock ofa result that few 
in the research world expected shows no sign of easing. 

The prevailing climate is uncertainty. Universities report panic 
among their staff. Despite reassurances from both the British govern- 
ment and the European Commission, the mere prospect of Brexit has 
already had a chilling effect on existing and future EU collaborations. 
One senior manager at a leading university told 
me how a long-standing professor from an EU 
country had asked about the risk of deportation. 

We must not despair: in some respects, we 
have been here before. British centre-right gov- 
ernments have a tendency to view international 
cooperation as less important than competitive- 
ness. Not only did one reject CERN, but a later 
Conservative government, led by Edward Heath, 
opposed the creation of the UN Environment 
Programme on the grounds that Britain already 
had the world’s finest environment policies and 
that further action could harm industry. And, 
although it was a founding member of the UN 
Educational, Scientific and Cultural Organiza- 
tion, the United Kingdom pulled out in 1985 
under the government of Margaret Thatcher. (It 
later rejoined.) 

Throughout those years, Britain’s scientists were no less outward- 
looking, no less collegial and no less innovative than ever. They just 
had to fight hard to retain their international links, as their successors 
and future generations will have to do. 

This will take preparation and political engagement. Despite uncer- 
tainty, science must assume that the decision will be carried out. “Brexit 
means Brexit’, Britain’s incoming Prime Minister Theresa May has said 
repeatedly. And she has put key members of the campaign for Britain to 
leave the EU in ministerial positions where they will be negotiating exit. 

Some in Britain’s scientific community believe that there could be an 
exemption so that EU colleagues can continue to travel, live and work 
freely in the United Kingdom, but that is unlikely. The Brexit cam- 
paigners won on the basis ofa promise that net migration to the United 
Kingdom, currently around 1 million every 3 years, will be cut to tens 
of thousands annually. That means that UK universities and scientists 
must prepare for movement within Europe to become more difficult. 

Scientists are correct to warn about the loss of EU partnerships — 
one pro-vice-chancellor for research said that the greatest loss would 


ik was no British delegate at the first meeting of the council 


DESPITE 


UNCERTAINTY, 


SCIENCE MUST 
ASSUME THAT 


BREXIT 


WILL BE 


CARRIED OUT. 


Resilient British science 
will withstand Brexit 


UK scientists have had to fight to stay international before, and they must not 
stop now, says Ehsan Masood. 


be relationships, rather than money — but that will cut little ice with 
the government’s Brexit-negotiating team. The entire argument to 
leave was built on the idea that the United Kingdom is too involved 
with Europe, and must therefore cut ties. In Parliament on 19 July, one 
of the leaders of the Brexit campaign, Michael Gove, said that British 
universities need not fear Brexit: they did not become among the best 
in the world by being part of any bureaucratic system. 

It is crucial that science has a voice in the Brexit negotiations. The 
community must work through those of its researchers already in 
responsible roles in government departments, who can communicate 
with the new political leadership. If they don't, then the fringe voices 
of climate-change deniers and others still popular with many on the 
extremes of the Conservative Party will gain influence. 

The UK Royal Society’s approach must also 
evolve. Over the past two decades, the post of 
the society’s foreign secretary has been a plum 
appointment. From a distance, it looked a bit 
like a colonial-era governorship, with nothing 
more onerous expected than to travel the world 
giving nice speeches and doing good. Now, the 
post's holder needs to start doing more to rep- 
resent the interests of UK science at home and 
abroad, ready for when Brexit negotiations begin. 
The CERN example is pertinent: despite official 
opposition, observers from the Royal Society 
attended subsequent council meetings. This 
helped to smooth admission when the national 
mood changed. 

Although UK researchers fear the loss of EU 
funds, including grants from the Horizon 2020 
research-funding programme, ministers will 
probably agree on some kind of compensation: perhaps a global fund 
made open to scientists around the world, with grants for excellence 
and a spread of international partners. Big philanthropic funders, 
notably the Wellcome Trust, could also step in with funds, as the trust 
did with its contribution of a few hundred million pounds to the Joint 
Infrastructure Fund during the late 1990s, a previous time of austerity. 

As important as it is to worry about loss of funds and free move- 
ment, the scientific community must take a more fundamental 
concern from the Brexit decision. The vote has exposed fears and 
divisions in British society, and a sizeable section of the political class 
exploited them. Scientists, universities and funders must do more to 
understand what caused these divisions, and at the same time help in 
the urgent task that is to heal them. m 


Ehsan Masood is editor of Research Fortnight. His latest book is The 
Great Invention: The Story of GDP and the Making and Unmaking 
of the Modern World. 

e-mail: ehsan.masood@researchresearch.com 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Early life liked 
it hot 


The last universal common 
ancestor of all life — a microbe 
dubbed LUCA that existed 
around 3.5 billion years 

ago — probably resided ina 
hydrothermal vent that had 
low oxygen levels. 

To find out how the 
organism lived, William 
Martin and his colleagues 
at Heinrich Heine 
University Diisseldorf in 
Germany reconstructed the 
evolutionary trees of more 
than 6 million genes from 
bacteria and archaea. They 
identified 355 protein families 
that were probably in LUCA’s 
genome — these are involved 
in anaerobic metabolism 
and fixing carbon dioxide 
and nitrogen. This suggests 
that LUCA lived in an 
environment that was rich in 
hydrogen, CO, and iron, such 
as a hydrothermal vent. 

LUCA may have depended 
heavily on the geochemistry of 
the vent to survive. 

Nature Microbiol. http://doi.org/ 
bm2s (2016) 


CHEMISTRY 


‘Molecules’ made 
from superatoms 


Chemists have built structures 
similar to molecules but made 
up of superatoms — clusters of 
atoms with some of the same 
properties as atoms. 

To make superatom 
‘molecules, Xavier Roy and 
his colleagues at Columbia 
University in New York City 
created cobalt-selenium 
clusters. They then attached 
two or three clusters to 
one another with ‘arms 
comprising various elements. 

The team hopes that 
the technique can be used 
to make materials with 


BEHAVIOURAL ECOLOGY 


Bird helps people to find honey 


A bird species responds to the specialized calls 
of human honey hunters, then leads them to 


bees’ nests. 


The greater honeyguide (Indicator indicator, 
pictured with honey hunter) benefits by eating 
the beeswax left behind by hunters after they 
break open bees’ nests to get the honey. Claire 
Spottiswoode at the University of Cambridge, 
UK, and her colleagues found that 75% of nests 
located by honey hunters in Niassa National 
Reserve, Mozambique, were found with 


tailored properties such 

as heat conduction or the 
ability to store information 
magnetically. 

Nano Lett. http://doi.org/bmw5 
(2016) 


Revived telescope 
finds 104 planets 


Astronomers have spied 
104 new worlds in the Milky 
Way using NASA’ Kepler 
Space Telescope. 

Part of Kepler broke 
down in 2013, but engineers 
managed to repair it and send 
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the help of the bird. Playing recordings of a 
traditional trill-grunt call made by the local Yao 


people while hunting doubled the chances of 


iton a fresh mission, dubbed 
K2. This latest discovery, 

from Ian Crossfield at the 
University of Arizona in 
Tucson and an international 
team, is the biggest so far for 
the K2 mission. The team 
reports numerous planetary 
candidates, and confirmed 
more than 100 as exoplanets 
using additional observations 
from ground-based telescopes. 
They found that the majority 
of planets are smaller than 
Neptune, and probably have 
thick atmospheres and rocky 
cores. Nearly 40 have a radius 
that is about twice that of 
Earth or smaller, and 4 of those 


attracting a honeyguide to 66%, compared with 
recordings of other human or animal sounds. 
Overall, the call tripled the chances of finding a 
bees’ nest to 54%. 

The ability to understand and respond to 
human calls is not restricted to domesticated 
animals, the authors say. 

Science 353, 387-389 (2016) 


orbit the same red dwarf star. 
Two of the planets orbiting 
the red dwarf could have 
irradiation levels that are 
similar to Earth’s, making life 
on those planets a possibility, 
according to the authors. 
Astrophys. J. Suppl. Ser. (in the 
press); preprint at http://arxiv. 
org/abs/1607.05263 (2016) 


How immune cells 
clear amyloid 


Three key proteins allow 
immune cells in the brain 
to clear out a protein called 


CLAIRE N. SPOTTISWOODE 


DANIEL J COX/GETTY 


DIDIER DESCOUENS/CC BY-SA 4.0 


amyloid-, which is a 
hallmark of Alzheimer’s 
disease. 

The immune cells, called 
microglia, normally absorb 
and digest amyloid-B 
after lipoproteins called 
APOE and CLU attach 
to it. Lino Gonzalez and 
Morgan Sheng of drug firm 
Genentech in South San 
Francisco, California, and 
their colleagues looked for 
interactions between human 
proteins, and found that a 
receptor on microglia called 
TREM2 binds to APOE and 
CLU. Mouse microglia lacking 
Trem2 were less effective 
at absorbing amyloid- 
lipoprotein complexes, 
and digested them more 
slowly than normal cells did. 
Microglia taken from people 
carrying a TREM2 mutation 
were also less able to take up 
the complexes. 

Mutations in the genes 
that encode TREM2, APOE 
and CLU have been linked to 
Alzheimer’s, and so the results 
show how these different 
genetic risk factors could be 
linked. 

Neuron 91, 328-340 (2016) 


PALAEOCLIMATE 


Analysing ancient 
air in salt crystals 


Gas trapped in ancient 
bubbles reveals that Earth's 
atmosphere was rich in oxygen 
up to 200 million years earlier 
than models have predicted, 
well before animal diversity 
exploded. 

Bubbles in salt crystals 
called halites (pictured) 
that formed millions of 
years ago can provide clues 
to ancient climates. Nigel 
Blamey of Brock University 


in St Catharines, Ontario, 
Canada, and his colleagues 
studied pockets of air inside 
815-million-year-old halites 
from southwest Australia. 
The air contained nearly 11% 
oxygen, more than expected 
for that time period. 

The authors suggest that 
high oxygen levels drove 
animal evolution, rather than 
the other way around. 
Geology http://doi.org/bmt3 
(2016) 


OPTICS 


Human eye sees 
single photons 


People can perceive flashes 
of light as feeble as a single 
photon. 

Alipasha Vaziri at the 
Rockefeller University in New 
York City and his colleagues 
asked three volunteers to stare 
into an optical system in the 
dark and listen to two sounds, 
one of which was sometimes 
accompanied by the emission 
of a photon. During more 
than 30,000 trials, the 
participants correctly 
identified a photon more 
frequently than would be 
expected if they had guessed 
at random. 

Going forwards, the team 
plans to test how the human 
visual system responds to 
photons in various quantum 
states. 

Nature Commun. 7,12172 
(2016) 

For more on this story, see 
go.nature.com/Z2anfrrf 


MARINE ECOLOGY 


Ice algae key to 
Arctic food web 


Even creatures living many 
metres below the Arctic 
Ocean's surface rely on algae 
that grow in sea ice and so, like 
those living near the surface, 
may feel the negative effects of 
shrinking ice. 

A team led by Doreen 
Kohlbach of the Alfred 
Wegener Institute Helmholtz 
Centre for Polar and Marine 
Research in Bremerhaven, 
Germany, collected small 


RESEARCH HIGHLIGHTS BiiiSaiiaa¢ 


crustaceans and other 
animals at different depths 
in the central Arctic Ocean 
and analysed their fatty- 

acid content to identify the 
source of carbon in their 
diets. Organisms living at the 
water-ice interface (such as 
Apherusa glacialis; pictured) 
got as much as 92% of their 
carbon from the ice algae. But 
creatures sampled at up to 

50 metres below the surface 
got 14-55% of their carbon 
from the algae. 

Melting sea ice means a 
shrinking habitat for algae, 
which could lead to decreased 
nourishment for the entire 
Arctic food web. 

Limnol. Oceanogr. http://doi.org/ 
bmtq (2016) 


Neurons compete 
to make memories 


Memories that are formed 
within a few hours of one 
another can be encoded bya 
shared set of neurons. 
Sheena Josselyn and Paul 
Frankland at the Hospital 
for Sick Children in Toronto, 
Canada, and their colleagues 
exposed mice to two fear- 
inducing experiences, each 
consisting of a different 
musical tone paired with a 
mild electric shock to the foot. 
When the events occurred 
within 6 hours of each other, 
the associated memories 
were encoded by some of the 
same neurons in the lateral 
amygdala — a brain area 
involved in fear memory. 
However, fear memories 
separated by 24 hours were 


encoded by separate groups of 
neurons. 

The researchers found that 
neurons stimulated by the 
first experience transiently 
suppressed the excitability 
of neighbouring neurons, 
increasing the chances that the 
same cells will respond to a 
second experience that follows 
soon afterwards. 
Science 353, 383-387 (2016) 


Carbon capture 
makes electricity 


A battery developed from 
widely available materials 
produces electricity by 
capturing carbon dioxide from 
gas mixtures. 

Wajdi Al Sadat and 
Lynden Archer at Cornell 
University in Ithaca, New 
York, designed an aluminium- 
based electrochemical cell. At 
the cell’s cathode, oxygen is 
reduced to form a superoxide, 
which then binds CO, and 
combines with aluminium 
from the anode to form 
aluminium oxalate. 

The authors estimate that for 
each kilogram of aluminium, 
more than 9 kilograms of 
CO, can be captured from 
flue gas and transformed to 
generate 3.6 kilowatt-hours 
of electricity. This offers 
a strategy to reduce CO, 
emissions while producing 
power. 

Sci. Adv. 2,e1600968 (2016) 
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SEVEN DA 


Turkey in turmoil 


The clampdown on political 
opposition by Turkish 
President Recep Tayyip 
Erdogan in the wake of the 
failed military coup on 15 July 
spread to the education sector 
last week. The Turkish Council 
of Higher Education (YOK) 
told all 1,577 of the country’s 
university deans to resign their 
posts, and placed a temporary 
international-travel ban on 
Turkish scholars (see Nature 
http://doi.org/bm4d; 2016). 
Erdogan has also closed 

15 universities that he alleges 
are closely tied to the Giilen 
religious movement, which 

he considers to be behind 

the coup. Several academic 
organizations in Europe, 
including the German Rectors’ 
Conference and the European 
Federation of Academies of 
Science and Humanities, have 
condemned the attack on 
academic independence. 


SciDev.Net cuts 
SciDev.Net, a science and 
technology news service for the 
developing world, announced 
on 22 July that it is shedding 
about 80% of its London-based 
staff, including much of the 
editorial team, owing toa 

lack of funding. SciDev.Net 
was launched in 2001 with 
funding from the international 
development agencies of 
Britain, Sweden and Canada, 
and with guidance from 
journals including Nature. It 
said that its decision to scale 
down its London operations 
reflected a commitment to 
preserve its regional services. 


EU collaboration 


University leaders from across 
Europe published a joint 
statement on 22 July calling for 
continent-wide collaboration 
to continue in the wake of 

the United Kingdom's vote 

to leave the European Union 


The news in brief 


ey 


/ “#FUTUREISCLEAN 


tun the world with clean technologies! 


Solar plane achieves historic global flight 


The world’ first fuel-free flight around the 
world, in a solar-powered aeroplane, concluded 
successfully on 26 July, when the Solar 

Impulse 2 craft landed in Abu Dhabi. Pilots 
Bertrand Piccard and André Borschberg flew 
40,000 kilometres in 17 separate flights over 

16 months. Among the records set during the 


last month. The statement, 
signed by representatives from 
24 national and international 
university consortia, calls 

for the exchange of ideas and 
people, as well as the pooling 
of infrastructure, data and 
expertise, to continue as part 
of the United Kingdom's new 
relationship with the rest of 
Europe. 


Digital free speech 
The Electronic Frontier 
Foundation (EFF), a group 

that fights for civil liberties in 
the digital space, has sued the 
US government over a law that 
it says restricts researchers’ 
ability to conduct IT science. 

It says that parts of the Digital 
Millennium Copyright Act that 
restrict access to computer code 
violate the first amendment 
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to the US Constitution, which 
guarantees freedom of speech. 
In particular, the law “stymies 
creators, academics, inventors, 
and researchers’, says the EFF, 
which filed the lawsuit on 

21 July. Among those bringing 
the suit is Matthew Green, 

an IT-security researcher at 
Johns Hopkins University in 
Baltimore, Maryland, who says 
that the law blocks the work of 
scientists such as him who want 
to probe the security of digital 
systems. 


Olive-tree deaths 


The European Commission 
sent Italy a stern warning 
on 22 July, telling it to 

fully implement plans to 
stop the bacterium Xylella 
fastidiosa — currently 
devastating olive groves in 


circumnavigation was Borschberg’s 118-hour 
stretch of solo endurance flying between Nagoya, 
Japan, and Hawaii. Solar Impulse took off from 
Cairo on 24 July for the final leg, and endured 
turbulence over the Saudi desert before touching 
down with Piccard at the controls. The craft has 
17,000 solar cells and a wingspan of 72 metres. 


the Puglia region — from 
spreading to other parts of 
Italy and Europe. In May, 

the commission and the 
Italian government agreed a 
containment plan that includes 
controversial measures such 
as felling infected trees and 
nearby healthy ones, but the 
commission says that the 
plan is not being put fully 

into practice. It argues that 

X. fastidiosa is one of the 
world’s most dangerous plant 
bacteria, and its spread would 
have a huge economic impact. 


| RESEARCH 
Still in the dark 


The most sensitive 
experiment yet designed to 
catch particles of dark matter 
particles has concluded its 


KARIM SAHIB/AFP/GETTY 
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search empty handed. After 
a 20-month run, the Large 
Underground Xenon (LUX) 
experiment in South Dakota 
announced on 21 July that 

it had seen no collisions of 
‘weakly interacting massive 
particles’ (WIMPs) with 

its 370 kilograms of liquid 
xenon. The researchers say 
that WIMPs could still exist, 
but might interact more rarely 
than current experiments 
can detect. Preparations are 
under way to build a much 
larger dark-matter detector, 
called LZ. 


Hottest year 


This year is on track to 

beat 2015 as the warmest 
year since systematic 
measurements began. The 
first 6 months of the year 
were 0.2°C warmer than the 
first half of 2015, the World 
Meteorological Organization 
said on 21 July. The average 
global temperature was 

1.3 °C above pre-industrial 
levels, and 1.05 °C above the 
twentieth-century average, 
according to data sets 
maintained by NASA, the 
US National Oceanic and 
Atmospheric Administration 
and the UK Met Office. A 
strong 2015-16 El Nifio 
warming event in the Pacific 
Ocean contributed to 

the record temperatures, 
which are driven by rising 
atmospheric concentrations 
of greenhouse gases. 


TREND WATCH 


Deaths from HIV/AIDS have 
declined steadily around the 


world over the past decade — but 
the annual rate of new infections 


has stayed much the same, a 
global data analysis published 
on 19 July shows (H. Wang 


et al. Lancet HIV http://doi.org/ 


bm2p; 2016). The number of 
new HIV infections peaked at 


3.3 million in 1997, and dropped 


by an average of about 2.7% 


each year to around 2.5 million 


in 2005. But the infection rate 
has stagnated since then. In 
74 countries, it has increased. 


Clinton picks Kaine 
Hillary Clinton, the 
presumptive Democratic 
nominee for US president, 
named Virginia senator 

Tim Kaine (pictured) 

as her vice-presidential 
running mate on 22 July. 
Kaine previously served as 
governor of Virginia, which 
has been among the key 
battleground states in recent 
elections. He advocated for 
action on climate change as 
governor and had worked on 
education and foreign policy 
in the Senate. 


Agriculture chief 


French President Frangois 
Hollande ignored scientists’ 
furious protests about 

his choice to head the 

French National Institute 

for Agricultural Research 
(INRA) for the next four 
years, and allowed his cabinet 


to confirm the appointment 
of Philippe Mauguin 

on 19 July. Mauguin, a 
former chief of staff to the 
agriculture minister, takes 
over “in a climate of strong 
mistrust’, says the INRA 
branch of the trade union 
CGT. It fears that Mauguin 
will focus on innovation at 
the expense of basic research, 
which would be potentially 
“deadly” for the agency’s 
position as a leader in 
agricultural research. 


Porpoise protection 


Environmental groups 

have welcomed the latest 
attempts by Mexico’s National 
Commission of Aquaculture 
and Fishing (Conapesca) to 
save the vaquita porpoise 
(Phocoena sinus), the world’s 
most endangered marine 
mammal. On 19 July, 
Conapesca announced rules 
banning ¢ill nets’ and night 
fishing in the northern part 
of the Gulf of California, 
where the animals live. Only 
60 vaquita were thought to 
have been alive in the gulf 

in December 2015, down 
from 97 in 2014; gill nets 

are blamed for much of the 
decline. 


Aircraft emissions 
The US Environmental 
Protection Agency (EPA) 
paved the way for new climate 


DECLINE IN NEW HIV INFECTIONS STALLS 
Although deaths from HIV/AIDS have dropped steadily over 
the past ten years, the global annual number of new 


infections has stayed largely the same. 
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27 JULY-10 AUGUST 
Young scientists from 
around the world take 
part in the London 
International Youth 
Science Forum 2016, 
hosted by Imperial 
College London. 
www.liysf.org.uk 


1-12 AUGUST 
Physicists discuss 
LIGO’s gravitational- 
wave detection at the 
Kavli Institute for 
Theoretical Physics 
in Santa Barbara, 
California. 
go.nature.com/2afhjwa 


regulations on 25 July by 
declaring that greenhouse- 
gas emissions from aircraft 
endanger public health and 
the environment. The finding 
comes five months after the 
United Nations’ International 
Civil Aviation Organization 
(ICAO) proposed the first 
greenhouse-gas standards for 
aircraft. Environmentalists 
criticized the ICAO proposal 
as weak and are pressuring 
the EPA to suggest stronger 
regulations. The EPA said 
that its standards would be 
“at least as stringent” as those 
proposed by the ICAO. 


Predator plan 


New Zealand hopes to be 
free of introduced predators 
— which include rats, 

stoats and possums — by 
2050, according to a plan 
announced on 25 July. The 
government will invest 

an initial NZ$28 million 
(US$20 million) and work 
with the private sector 

to rid the country of the 
predators, which it says kill 
25 million native birds each 
year. Conservation minister 
Maggie Barry says that the 
plan will need technology that 
does not yet exist. 
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Hillary Clinton and Donald Trump are battling for the White House. 


_US ELECTION | 


Trump vs Clinton: worlds 
apart on science 


Presidential candidates begin to make clear their stark differences on climate change, 
energy production and stem-cell research. 


BY JEFF TOLLEFSON 


cience is slowly coming into focus in the 
US presidential campaign. Although 


neither Republican Donald Trump nor 
Democrat Hillary Clinton has emphasized 
core research issues, the candidates — and 
their parties — are beginning to flesh out their 
positions on climate change, education, bio- 
medical research and other topics that involve 
the scientific community. 
Trump’s pick of Indiana Governor Mike 
Pence as his running mate on 15 July signalled 


a sharp turn towards the Republican party’s 
conservative base. Pence, a self-described 
Christian conservative, has questioned the 
existence of climate change, waffled on evo- 
lution and criticized President Barack Obama 
for supporting embryonic-stem-cell research. 
His new role aligns with the hard-line policy 
platform adopted at the Republican conven- 
tion, where Trump officially became the party’s 
nominee on 19 July. 

If Trump wins, Pence’s rise could embolden 
conservative Republicans to seek new limits 
on federal funding for embryonic-stem-cell 


research. But predicting how Trump would 
govern is a dangerous parlour game, says 
Michael Werner, executive director of the Alli- 
ance for Regenerative Medicine, an advocacy 
group in Washington DC: “We really don’t know 
what a Trump-Pence administration would do.” 
It’s acommon refrain. Deciphering Trump's 
views on core science issues has been difficult 
given the free-wheeling style of his populist 
campaign. He has often seemed to focus more 
on taunting the political establishment than 
on staking out policy positions. By contrast, 
the Clinton campaign has consulted dozens 
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> of scientists on topics that include health, 
education and the environment. 

“Trump doesn’t have a prominent policy 
shop and a prominent set of policy advisers,” 
says Douglas Holtz-Eakin, who counselled 
Republican senator John McCain (Arizona) on 
economic policy during his failed 2008 presi- 
dential bid. “Clinton has a vast bureaucracy 
and a ten-point plan for going out to lunch, so 
they couldn't be more different.” 

The two candidates — whose campaign 
staff declined multiple interview requests 
— also seem to think very differently about 
the role of science. Although Clinton has 
described science and innovation as a foun- 
dation for the future, science funding seems 
to be an afterthought for Trump, says John 
Karsten, coordinator of the Center for Tech- 
nology Innovation at the Brookings Institu- 
tion, a think tank in Washington DC. Instead, 
the Republican has focused on issues such as 
national security, immigration and crumbling 
infrastructure. 

Climate change is one of the few science 
topics that has grabbed the campaign spotlight 
— in part because of Republican anger over 
Obama's regulations to limit greenhouse-gas 
emissions from power plants, vehicles and oil 
and gas development. Clinton's climate and 
energy proposals would largely maintain the 
current course; by contrast, in a major policy 
speech on 26 May, Trump promised to roll 
back Obama's “totalitarian” regulations and 
withdraw the United States from the Paris 


climate agreement. Trump, who has long 
denied mainstream climate science, also said 
that his administration will focus on “real envi- 
ronmental challenges, not phony ones’. 


SPLIT TICKETS 
This yawning philosophical divide is apparent 
in the party platforms that the Republicans and 
Democrats developed ahead of their nominat- 
ing conventions this month. Environmental- 
ists have criticized the Republican platform 
for labelling coal a “clean” energy source, even 
though it produces 


“Climate is more carbon dioxide 
going to be emissions per unit of 
talked about in energy than any other 


fossil fuel. Demo- 
crats, meanwhile, are 
poised to adopt a platform this week at their 
national convention that calls for using “every 
tool available to reduce emissions now”. 

“Climate is going to be talked about in 
this campaign, because the candidates have 
distinctly different positions,’ says Michael 
Oppenheimer, a climate scientist at Princeton 
University in New Jersey who is advising the 
Clinton team. Although his workload was light 
during primary season, Oppenheimer antici- 
pates questions from the campaign about how 
global warming might affect certain regions, or 
the extent to which an extreme weather event 
might be related to global warming. 

Some experts say that the Democratic 
party's adoption of science as a campaign issue 


this campaign.” 


— which Obama kick-started in 2008 — risks 
further polarizing thorny policy debates around 
scientific issues such as global warming. “The 
Democrats found that science was a good thing 
for them, just like historically strong support for 
the military was good for the Republicans,’ says 
Daniel Sarewitz, co-director of Arizona State 
University’s Consortium for Science, Policy 
and Outcomes in Washington DC (anda regu- 
lar contributor to Nature). “Ifthe Democrats are 
the party of science, and you are a Republican, 
what does that make you think?” 

But Holtz-Eakin says that the Trump cam- 
paign’s apparent decision to forgo science 
advice is a reflection of Trump himself, not 
of Republican priorities. In 2008, he notes, 
the McCain campaign consulted scientists to 
formulate its positions on issues such as global 
warming — just as Clinton has done. 

With just over three months until the elec- 
tion, there is still a chance that Trump will 
assemble his own coterie of science advisers, 
says Andrew Rosenberg, who heads the Center 
for Science and Democracy at the Union of 
Concerned Scientists in Cambridge, Mas- 
sachusetts. Doing so not only informs policy 
positions, it builds relationships that are useful 
after the election, when the winning candidate 
begins to assemble a government. 

“These things widen the network,’ Rosen- 
berg says. “I know it’s happening with the 
Clinton campaign, and at some point I would 
expect it would happen with the Trump 
campaign.” = 


ASTRONOMY 


How to hunt for alien life 


Astrobiologists try to determine the chemical signature of life on other worlds. 


BY ALEXANDRA WITZE 


alarms abound. Researchers have gener- 

ally considered, and rejected, claims rang- 
ing from a 1970s report of life on Mars to the 
1990s ‘discovery’ of fossilized space microbes 
in a meteorite. 

Now, inspired by the detection of thousands 
of planets beyond the Solar System, NASA has 
started a fresh effort to learn how to recognize 
extraterrestrial life. The goal is to understand 
what gases alien life might produce — and how 
Earth-bound astronomers might detect such 
‘biosignatures’ in light passing through the 
atmospheres of planets trillions of kilometres 
away (see ‘Searching for alien life’). 

The agency will convene a workshop this 
week in Seattle, Washington, with the ultimate 
goal of advising a NASA exoplanet group on 


E the search for life beyond Earth, false 
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how to avoid embarrassing errors in the future. 
“We have to come together and determine 
what good evidence of life on another planet 
could be,’ says Shawn Domagal-Goldman, one 
of the workshop’s organizers and an astrono- 
mer at NASA's Goddard Space Flight Center in 
Greenbelt, Maryland. 

The exercise comes at a crucial time, as 
astronomers grapple with how to interpret 
exoplanet data from the next generation of 
telescopes. Some scientists are working to 
understand how nature could produce arche- 
typal biosignature gases, such as oxygen, in the 
absence of living organisms. Others are trying 
to think as expansively as possible about the 
types of biochemistry that could sustain life. 

“We could fool ourselves into thinking a 
lifeless planet has life — or we could be miss- 
ing life because we don't really understand the 
context of what could be produced on another 


planet,” says Sarah Rugheimer, an astronomer 
at the University of St Andrews, UK. 

Detecting a biosignature gas is just the first 
step to understanding what could be happening 
onan exoplanet. Each world has its own combi- 
nation of physical and chemical factors that may 
or may not lead to life, says Victoria Meadows, 
an astronomer at the University of Washington 
in Seattle. “Planets are hard, and we shouldn't 
think they are all going to be the same or reveal 
their secrets very easily,” she says. 

A planet’s environment is key. Some Earth- 
sized planets orbit M dwarf stars — the most 
common type of star in the Galaxy — at the 
right distance to harbour liquid water. But 
Meadows’ collaborators have shown! that 
photochemical reactions can send water into 
the planet’s atmosphere and then break off its 
hydrogen, which escapes into space. What's left 
is a thick blanket of oxygen that might seem as 
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SEARCHING FOR ALIEN LIFE 


Astrobiologists are fine-tuning the list of substances that, if spotted on a planet 
orbiting another star, could constitute evidence of extraterrestrial life. 


LIFE AS WE KNOW IT 


One method is to study a star’s light for 
the chemical imprint of gases that may 
have been formed by living organisms. 
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Changes in the starlight transmitted through 
the planet’s atmosphere reveal gases within. 
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if it came from living organisms, but results 
from a runaway greenhouse effect. 

There are ways to tell. The runaway green- 
house would create an atmosphere thousands 
of times denser than Earth’s, in which O, 


LIFE AS WE DON’T 


Another approach is to evaluate a 
huge range of molecules, winnowing 
them down on the basis of factors 
such as stability and detectability. 


All small 
molecules 


aean Earth 


Possible 
biosignature 
gases 


molecules collide to produce O,. So spotting 
O, ina planet’s atmosphere could be a clue that 
the oxygen does not, in fact, come from life, 
Meadows’ team reported this year’. 

Another method is to draw up a list of 


IN FOCUS 


alternative biosignature gases — things not 
as obvious as oxygen that might be made by 
organisms under certain conditions. These 
include dimethyl sulfide’, which is produced 
by Earthly phytoplankton, or even ammonia’. 
Onacold alien planet, organisms might make 
the gas using the same chemical process as 
industrial manufacturers. 

At the Massachusetts Institute of Technol- 
ogy in Cambridge, astronomer Sara Seager has 
begun to examine 14,000 compounds that are 
stable enough to exist in a planetary atmos- 
phere. She and her colleagues are winnowing 
down their initial list of molecules using crite- 
ria suchas whether there are geophysical ways 
to send the compound into the atmosphere’. 

“We're doing a triage process,” says Seager. 
“We don't want to miss anything.” 

The Seattle meeting aims to compile a work- 
ing list of biosignature gases and their chemi- 
cal properties. The information will feed into 
how astronomers analyse data from NASA‘s 
James Webb Space Telescope, slated for launch 
in 2018. The telescope will be able to look at 
only a handful of habitable planets, but it will 
provide the first detailed glimpse of what gases 
surround which world, says Nikole Lewis, an 
astronomer at the Space Telescope Science 
Institute in Baltimore, Maryland. 

No single gas is likely to be a slam-dunk 
indicator of alien life. But Domagal-Goldman 
hopes that the workshop will produce a frame- 
work for understanding where scientists could 
trip themselves up. “We don’t want to have a 
great press release,’ he says, “and then a week 
later have egg on everybody’s faces.” = 
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Zika virus might not be only factor in reported microcephaly surge. 


BY DECLAN BUTLER 


overnment researchers in Brazil are set 
to explore the country’s peculiar dis- 
tribution of Zika-linked microcephaly 

— babies born with abnormally small heads. 
Zika virus has spread throughout Brazil, 
but extremely high rates of microcephaly 
have been reported only in the country’s 


northeast. Although evidence suggests that 
Zika can cause microcephaly, the clustering 
pattern hints that other environmental, socio- 
economic or biological factors could be at play. 

“We suspect that something more than Zika 
virus is causing the high intensity and sever- 
ity of cases,’ says Fatima Marinho, director 
of information and health analysis at Brazil's 
ministry of health. If that turns out to be true, 


it could change researchers’ assessment of the 
risk that Zika poses to pregnant women and 
their children. 

The idea has long been on Brazilian research- 
ers’ radar, but the enquiry marks the first time 
that scientists at the health ministry have taken 
up the hypothesis. The ministry has asked 
Oliver Brady, an epidemiologist at the London 
School of Hygiene & Tropical Medicine, 
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> and Simon Hay, director of 
geospatial science at the Institute 
for Health Metrics and Evaluation 
in Seattle, Washington, to collab- 
orate with researchers in Brazil. 
“The aim is to understand why we 
are only observing elevated rates 
in the northeast,’ says Brady, who 
flew into Brasilia this month to 
begin work. 

The northeast was where the 
first reported surge in micro- 
cephaly cases in Brazil began a 
year ago. Health officials had 
expected that they would later see 
the same high rates in other parts 
of the country. “We were expect- 
ing an explosion of birth defects,” 
says Marinho. 

But as of 20 July, almost 90% 
of the 1,709 confirmed cases 
of congenital microcephaly or 
birth defects of the central nervous system 
reported in Brazil since last November were 
ina relatively small area: in the coastal hin- 
terland of the country’s northeastern tip. 
Particularly surprising, says Marinho, is 
that just three cases have been confirmed in 
Brazil's second-most populous state, Minas 
Gerais, which borders the most-affected part 
of the northeast region. Poor data on the scale 
and timing of Zika outbreaks across Brazil 
make it hard to tell whether increases in micro- 
cephaly elsewhere might have been delayed 
— but ministry scientists now think that the 
northeast represents a marked outlier, she says. 

There are many hypotheses about what 
might be going on. Marinho says that her 
team’s data, submitted for publication, hint that 
socio-economic factors might be involved. For 


A health worker sprays insecticide to combat the mosquito that spreads Zika. 


example, the majority of women who have had 
babies with microcephaly have been young, 
single, black, poor and tend to live in small 
cities or on the outskirts of big ones, she says. 

Another idea is that co-infections of Zika 
and other viruses, such as dengue and chikun- 
gunya, might be interacting to cause the high 
intensity of birth defects in the area. 

Ina paper published last month, research- 
ers from Brazilian labs noted a correlation 
between low vaccination rates for yellow fever 
and the microcephaly clusters (L. P. de Goes 
Cavalcanti et al. J. Infect. Dev. Countries 10, 
563-566; 2016). Because yellow fever and 
Zika are in the same virus family, the scientists 
speculate that the vaccine might provide some 
protection against Zika. 

And the Brazilian doctor who was the first 


to report a firm link between Zika 
and microcephaly — Adriana 
Melo at IPESQ, a research insti- 
tute in Campina Grande — has 
another idea. In a preprint 
posted on the bioRxiv server 
on 15 July, Melo and her col- 
leagues at the Federal Univer- 
sity of Rio de Janeiro reported 
finding bovine viral diarrhoea 
virus (BVDV) proteins in the 
brains of three fetuses with 
microcephaly (F. C. S. Nogueira 
et al. Preprint at bioRxiv http:// 
doi.org/bm4c; 2016). 

BVDV causes birth defects in 
cattle but is not known to infect 
people. Melo and her team sug- 
gest that Zika infection might 
make it easier for BVDV to cause 
infections; however, they haven't 
ruled out the possibility that their 
findings might be due to contamination. 

The Brazilian health ministry’s study will 
test for BVDV among other ideas, says Brady. 
Researchers will reanalyse raw data on micro- 
cephaly cases, and will model connections with 
possible cofactors such as socio-economic sta- 
tus, water contamination and mosquito-borne 
diseases. Most of this information comes from 
health-ministry databases, but the team will 
also study experimental data, such as how peo- 
ple’s immune response may change after past 
infection with other viruses such as dengue. 

Until more is known about Zika and the 
causes of increased microcephaly rates in 
Brazil's northeast, public-health actions and 
advice must err on the side of precaution, says 
Ian Lipkin, a virologist and outbreak special- 
ist at Columbia University in New York City. m 
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First trial of CRISPR in people 


Chinese team approved to test gene-edited cells in people with lung cancer. 


BY DAVID CYRANOSKI 


hinese scientists are on the verge 
( of being first in the world to inject 

people with cells modified using the 
CRISPR-Cas9 gene-editing technique. 

A team led by Lu You, an oncologist at 
Sichuan University’s West China Hospital in 
Chengdu, received ethical approval to test the 
cells in people with lung cancer on 6 July, and 
plans to start the trial next month. 

That timeline puts the proposal ahead 
of a planned US trial to test CRISPR-Cas9- 
modified cells, also for the treatment of cancer. 
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“It’s an exciting step forward,’ says Carl June, 
a clinical researcher in immunotherapy at the 
University of Pennsylvania in Philadelphia. 

Last month, the US trial was approved by 
an advisory panel of the US National Insti- 
tutes of Health (NIH) but had yet to receive 
a green light from the US Food and Drug 
Administration (FDA) and a university review 
board. There have also been a number of 
human clinical trials using an alternative gene- 
editing technique, including one led by June, 
that have helped patients to combat HIV — but 
none so far has used CRISPR. 

The Chinese trial will enrol patients who 


have metastatic non-small cell lung cancer and 
for whom chemotherapy, radiation therapy 
and other treatments have failed. “This tech- 
nique is of great promise in bringing benefits 
to patients,” says Lu. 


CHROMOSOME SNIP 

Lu’s team will extract immune cells called 
T cells from the participants’ blood, and use 
CRISPR-Cas9 technology — which pairs 
a molecular guide able to identify specific 
genetic sequences on a chromosome with 
an enzyme that can snip the chromosome at 
that spot — to knock outa specific gene in the 
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cells. The target gene encodes a protein called 
PD-1 that normally acts as a check on the cell's 
capacity to launch an immune response. 

The gene-edited cells will then be multiplied 
in the lab and re-introduced into the patient’s 
bloodstream, where, the team hopes, they will 
home in on the cancer. The proposed US trial 
similarly involves knocking out the gene for 
PD-1, but also includes knocking out a second 
gene and inserting a third. 

Last year, the FDA approved two antibody- 
based therapies that block PD-1 for use against 
lung cancer. Gene editing is expected to inhibit 
PD-1 with greater certainty, and by multiply- 
ing the cells, the scientists can increase the 
chance of triggering an immune response 
against tumours. 

It is well known that CRISPR-Cas9 can 
result in edits at the wrong place in the 
genome, with potentially harmful effects. 
Chengdu MedGenCell, a biotechnology com- 
pany in China and a collaborator on the trial, 
will validate the cells to ensure that the cor- 
rect genes are knocked out before the cells are 
re-introduced into the patients, says oncologist 
Lei Deng of West China Hospital, a member 
of Lu’s team. 


Because the technique targets T cells 
— which are involved in various types of 
immune response — in a non-specific way, 
Chan worries that the approach might induce 
an autoimmune response in which T cells cir- 
culating in the blood might start to attack the 
gut, adrenaline glands or other normal tissue. 

He suggests, instead, that the team take 
T cells from the site of the tumour because 
they would already be specialized for attack- 
ing cancer. But Deng says that the lung-cancer 
tumours targeted by their trial are not eas- 
ily accessible. He also says that the team is 
reassured by the FDA-approved antibody 
therapies, which did not show a high rate of 
autoimmune response. 


SAFETY FIRST 

The phase I trial is designed foremost to 
test whether the approach is safe. It will 
examine the effects of three different dosage 
regimens on ten people, and, Deng says, the 
team plans to proceed slowly by increasing 
the dosage gradually and starting with just 
one patient, who will be monitored closely 
for side effects. But the researchers will also 
closely watch markers in the blood that would 
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indicate that the treatment is working. 

China has had a reputation for moving fast 
— sometimes too fast — with CRISPR, says 
Tetsuya Ishii, a bioethicist at Hokkaido Uni- 
versity in Sapporo, Japan. 

Lu says that his team was able to progress 
so rapidly because they are experienced with 
clinical trials of cancer treatments. The review 
process, which took halfa year, included close 
communication with the hospital’s inter- 
nal review board (IRB). “There was a lot of 
back and forth,” he says. The NIH’s approval 
of the other CRISPR trial “strengthened 
our and our IRB’s confidence in this study’, 
he adds. 

June is not surprised that a Chinese group 
has jumped out in front, he says, because 
“China places a high priority on biomedi- 
cal research” Ishii notes that the clinical trial 
would be the latest in a series of firsts for 
China in the field of CRISPR — including 
the first CRISPR-edited human embryos and 
monkeys. “When it comes to gene editing, 
China goes first,” he says. 

Lu remains cautious. “I hope we are the 
first” he says. “And more importantly, I hope 
we can get positive data from the trial.” m 


EUROPEAN UNION 


Major funder tracks impact 


European Research Council embarks on an unusual evaluation that could inspire others. 


BY ALISON ABBOTT 


ast month, neuroscientist Ileana 
[nen cpat began a risky project 
with a risqué name: Psychocell. With 

a grant of €2 million (US$2.2 million), she 
is studying whether a single type of neuron 
causes a miswiring in the developing brain that 
has been linked to psychiatric disease. But it 
may turn out that no ‘psychocell’ exists, or that 
her mouse models are unsuitable. 

Supporting such blue-skies research is the 
mission of her funder, the prestigious Euro- 
pean Research Council (ERC), which launched 
in 2007 to raise the quality of European science. 
“No one but the ERC would have funded such 
a high-risk project,” says Hanganu-Opatz, 
from the University of Hamburg, Germany. 

Now, the council, which sits within the 
European Union’s Framework funding pro- 
grammes and has a €1.7-billion budget this 
year, has embarked on an unusual exercise: 
to retrospectively evaluate the success of the 
projects it funds. By contrast, most funding 
agencies assume that the evaluation to select 
which projects they fund is sufficient. 

“Virtually no basic research funding 


TO SCIENCE AND BEYOND 


Scientific experts evaluated 199 completed projects funded by the European Research Council — 
including their contribution to science and to wider society. 


Overall grade attributed to projects 


Scientific 
breakthrough 21% 


Major scientific 
advance 50% 


Had impact beyond science that is already apparent 


To a large extent Somewhat 


(o} 
NO 
io} 


agency tries retrospectively to analyse its own 
performance and impacts,’ says Erik Arnold, 
chair of Technopolis, a European research 
and innovation consultancy headquartered 
in Brighton, UK. “It would be nice if the ERC 
effort would inspire others to do so.” 

On 26 July, at the European Science Open 
Forum in Manchester, UK, ERC president Jean- 
Pierre Bourguignon announced the results of 
a pilot investigation of 199 completed projects, 
almost three-quarters of which were deemed 


Incremental scientific 


No appreciable scientific 


contribution 25% contribution 4% 


Not at all Not applicable 


60 100% 


to have resulted in a scientific breakthrough or 
major advance (see “To science and beyond’). 

“We push both scientists and grant- 
application reviewers to take a certain risk, so 
it is important to know that they are actually 
taking risks — and that we are selecting the 
right projects,’ says Bourguignon. 

The ERC now plans to evaluate a selection of 
completed projects each year and to keep refin- 
ing its methodology. Bourguignon hopes that 
this will help the council during discussions > 
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> with politicians. “We want evidence-based 
arguments to show that bottom-up, curi- 
osity-driven research is valuable to society,’ 
he says. The council will have to lobby to keep 
its generous funding in the next Framework 
programme, due to begin in 2021. 

The pilot evaluation rated projects that were 
among the first to be funded by the council, 
mostly in 2007 and 2008. It assigned eight pro- 
jects each to 25 three-person expert groups. 

The ERC gave the experts a bibliometric 
analysis of the publications from each project, 
but asked them to use their professional judge- 
ment to form an overall view of each one. 

They found that 43 had led to a scientific 
breakthrough, 99 had generated a major 
advance — and only 7 had had no appreciable 
scientific output. That indicates an appropriate 
level of risk and ambition, says Bourguignon. 

The evaluators also judged that almost 10% 
of projects had already had a large impact on 
the economy, policymaking or other aspects 
of society, and that around one-quarter were 
likely to do so in the future. 

“Tt’s a delight to see a qualitative approach,” 
says science-policy specialist Ben Martin at the 
University of Sussex in Brighton. “Bibliometrics 
are misleading in isolation — but too often used 
this way.’ Bourguignon says that many of the 
evaluators, who remain anonymous, struggled 
with the unfamiliar task of subjectively declar- 
ing research a “scientific advance”. 

The study has limitations. Two experts in 
each group had served on ERC grant-awarding 
panels. None of the projects that they judged 
was included in the analysis, but the process 
could seem unobjective, says Arnold. Martin 
says the terms used to categorize the projects 
may be interpreted differently across disciplines. 
“As a social scientist, I can tell you that we don't 
describe our work in terms of ‘breakthroughs” 

Bourguignon agrees that the small study 
was not optimally designed. But the ERC has 
since solicited independent comments on the 
methodology, and an ongoing evaluation of a 
further 250 projects has been fine-tuned to let 
evaluators across disciplines report consistently. 

In the pilot review, evaluators also stressed the 
ERC’s impact on an individual’s career, some- 
thing that Hanganu-Opatz experienced at first 
hand. Her university gave her tenure on 18 July, 
and three other universities made her offers. 
“The visibility you get when you win an ERC 
grant is embarrassing,” she says. m SEE EDITORIALP.465 
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COMMUNICATIONS 


China’s 600-kilogram quantum satellite contains a crystal that produces entangled photons. 


ee 


One giant step for 
quantum internet 


Chinese satellite is first in a wave of planned craft that 
could form network secured by quantum cryptography. 


BY ELIZABETH GIBNEY 


hina is poised to launch the world’s 
( first satellite designed to do quantum 

experiments. A fleet of quantum- 
enabled craft is likely to follow. 

First up could be more Chinese satellites, 
which will together create a super-secure 
communications network, potentially link- 
ing people anywhere in the world. But groups 
from Canada, Japan, Italy and Singapore also 
have plans for quantum space experiments. 

“Definitely, I think there will be a race,’ says 
Chaoyang Lu, a physicist at the University of 
Science and Technology of China in Hefei, 
who works with the team behind the Chinese 
satellite. The 600-kilogram craft, the latest in 
a string of Chinese space-science satellites, 


will launch from Jiuquan Satellite Launch ¥ 
Center in August. The Chinese Academy 2 
of Sciences and the Austrian Academy of 
Sciences are collaborators on the US$100- 
million mission. 

Quantum communications are secure 
because any tinkering with them is detect- 
able. Two parties can communicate secretly 
— by sharing a encryption key encoded in 
the polarization of a string of photons, say 
— safe in the knowledge that any eavesdrop- 
ping would leave its mark. 

So far, scientists have managed to demon- 
strate quantum communication up to about 
300 kilometres. Photons travelling through 
optical fibres and the air get scattered or 
absorbed, and amplifying a signal while pre- 
serving a photon’s fragile quantum state is 


CAI YANG/XINHUA VIA ZUM. 


‘) | MORE NEWS | 
Brexit watch: @ Dutch agency launches first grants Timing our 
MORE UK researchers | programme dedicated to replication breathing; 
scramble go.nature.com/2a6qjwh working with 
ONLINE to save @ Parasite infection calls zebrafish indigenous 
science work into question go.nature.com/2a6qatn | | peoples; and yeast 
go.nature. @ The Milky Way’s dark twin go.nature. genetics nature.com/ 
com/2arxwil com/2a6qhib nature/podcast 


478 | NATURE | VOL 535 | 28 JULY 2016 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


— 


extremely difficult. The Chinese researchers 
hope that transmitting photons through space, 
where they travel more smoothly, will allow 
them to communicate over greater distances. 

At the heart of their satellite is a crystal that 
produces pairs of entangled photons, whose 
properties remain entwined however far apart 
they are separated. The craft's first task will be 
to fire the partners in these pairs to ground 
stations in Beijing and Vienna, and use them 
to generate a secret key. 

During the two-year mission, the team also 
plans to perform a statistical measurement 
known asa Bell test to prove that entanglement 
can exist between particles separated by a dis- 
tance of 1,200 kilometres. Although quantum 
theory predicts that entanglement persists at 
any distance, a Bell test would prove it. 

The team will also attempt to ‘teleport’ quan- 
tum states, using an entangled pair of photons 
alongside information transmitted by more 
conventional means to reconstruct the quan- 
tum state of a photon in a new location. 

“If the first satellite goes well, China will 
definitely launch more,” says Lu. About 20 sat- 
ellites would be required to enable secure com- 
munications throughout the world, he adds. 

The teams from outside China are taking 
a different tack. A collaboration between the 
National University of Singapore (NUS) and 
the University of Strathclyde, UK, is using 


cheap 5-kilogram satellites known as cube- 
sats to do quantum experiments. Last year, 
the team launched a cubesat that created and 
measured pairs of ‘correlated’ photons in orbit; 
next year, it hopes to launch a device that pro- 
duces fully entangled pairs. 

Costing just $100,000 each, cubesats make 
space-based quantum communications acces- 
sible, says NUS physicist Alexander Ling, who 
is leading the project. 

A Canadian team proposes to generate 
pairs of entangled photons on the ground, 
and then fire some of them to a microsatel- 
lite that weighs less than 30 kilograms. This 
would be cheaper than generating the photons 
in space, says Brendon Higgins, a physicist at 
the University of Waterloo, who is part of the 
Canadian Quantum Encryption and Science 
Satellite (QEYSSat) team. But delivering the 
photons to the moving satellite would be a 
challenge. The team plans to test the system 
using a photon receiver on an aeroplane first. 

An even simpler approach to quantum space 
science, pioneered by a team at the University 
of Padua in Italy led by Paolo Villoresi, involves 
adding reflectors and other simple equipment 
to regular satellites. Last year, the team showed 
that photons bounced back to Earth off an 
existing satellite maintained their quantum 
states and were received with low enough error 
rates for quantum cryptography (G. Vallone 
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et al. Phys. Rev. Lett. 115, 040502; 2015). 

Researchers have also proposed a quan- 
tum experiment aboard the International 
Space Station (ISS) that would simultaneously 
entangle the states of two separate properties 
ofa photon to make teleportation more reliable 
and efficient. 

As well as making communications much 
more secure, these satellite systems would 
mark a major step towards a ‘quantum inter- 
net’ made up of quantum computers around 
the world, or a quantum computing cloud, 
says Paul Kwiat, a physicist at the University of 
Illinois at Urbana-Champaign who is working 
with NASA on the ISS project. 

Eventually, quantum teleportation in space 
could even allow researchers to combine 
photons from satellites to make a distributed 
telescope with an effective aperture the size of 
Earth — and enormous resolution. “You could 
not just see planets,” says Kwiat, “but in prin- 
ciple read licence plates on Jupiter’s moons.’ = 


CORRECTION 

The News story ‘Canada builds quake 
warning system’ (Nature 534, 446-447; 
2016) incorrectly stated that the warning 
system being developed by Ocean Networks 
Canada would be the first in Canada. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ask 


The ice historians 


TO TELL WHETHER GREENLAND’S GLACIAL CAP WILL MELT AWAY ANY 
TIME SOON, RESEARCHERS ARE PORING OVER OLD PHOTOGRAPHS AND 
DRAWINGS FOR CLUES TO ITS PAST BEHAVIOUR. 


BY QUIRIN SCHIERMEIER 
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nders Bjork could have stepped out from a painting of a 
nineteenth-century Arctic expedition. Tall, athletic and burning 
for outdoor action, he has been attracted to the wild fjords and 
glaciers of Greenland since his university days, when he took sum- 
mer tourists on guided tours there. He still ventures to the island every 
summer to measure its waning ice sheet — and he doesn't always travel 
by air. In 2014, he retraced the route of early Danish explorers by sailing 
to Greenland in a three-masted wooden schooner built in the 1930s. 

But on a pleasant spring day in Copenhagen, Bjork is carrying out a 
much safer reconnaissance. In the belly of Denmark’s national archive, 
he sifts through yellowed files filled with rows of daily weather and 
ocean-temperature measurements that Danish clergymen and village 
chiefs made decades ago along the Greenland coast. 

“I get a kick out of sitting over old maps and documents,” he says. “I 
somehow find that kind of stuff just as fascinating as doing field work” 
Bjork has alternated expeditions to the archives and to distant glaciers 
for ten years, ever since he first came across tens of thousands of aerial 
photographs of Greenland from the 1930s, lying all but forgotten in a 
seventeenth-century fortress outside Copenhagen. 


Danish geologist Lauge Koch 
(centre) and colleagues on the 
open-cockpit Heinkel plane 
that his team used to survey 
East Greenland in 1932. 
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Bjork and his colleagues at the Natural 
History Museum of Denmark in Copenha- 
gen are ice historians. They are combing the 
old records to document how Greenland’s 
ice sheet and glaciers have behaved since the 
nineteenth century — a crucial set of infor- 
mation for climate scientists trying to predict 
how it might change in the future. 

With Arctic temperatures rising faster than anywhere else on Earth, 
Greenland is now losing about 200 billion tonnes of ice per year and 
raising ocean levels around the globe. Projections suggest that melting 
from the island might swell sea levels by 30 centimetres by the end of this 
century. If all Greenland’s ice melted — a possibility over the next few 
centuries — it would push up sea level by more than 6 metres, enough to 
flood coastal megacities such as New York and Miami. 

But the projections carry large uncertainties, in part because researchers 
lack basic information about Greenland’s past. Satellite data only go 
back 40 years, which is why Bjork and his colleagues are poring over 
180,000 photographs and other data that record how glaciers have 
advanced and retreated during cold and warm spells in the recent past. 
Their first sets of findings suggest that Greenland ice has responded 
more strongly to past climate changes than was previously realized. 
Now, the researchers are trying to unravel what factors within the 
oceans, atmosphere and inside glaciers control their behaviour. 

What sets Bjork apart, say other scientists, is that he combines 
the heart of a seafarer with a strong sense of detail and creativity in 
research. The studies by him and his fellow ice historians are making key 
contributions to glacial science, says Beata Csatho, a glaciologist at the 
University of Buffalo in New York. “Collecting historical data sets is 
probably more important at this point than having yet another satellite 
do more of the same stuff” she says. “This is not incremental progress 
— it is a big advance in science.” 


EPIC ADVENTURES 

Bjork’s fascination with Arctic exploration reaches back to his 
childhood. Like many Danes, he learned to sail at a young age and over 
the years he embarked on a series of increasingly epic voyages that 
blended science and adventure. 

In Bjork’s most extreme expedition, he and a small crew of 
like-minded scientists sailed through the storm-tossed south Atlantic 
for two months in 2011. Svante Bjérck, a Quaternary geologist at the 
University of Lund in Sweden, had chartered a two-masted ship to sail 
from the Falkland Islands to Tristan da Cunha, the most remote inhab- 
ited island in the world. During the 13-day trip, Bjork spent several 
nights awake on deck, tied to the mast during gales, helping the captain 
steer the ship through huge waves. “It was scary. I’ve never seen anything 
like this,’ he says. The return journey to Uruguay took 33 days against 
relentlessly unfavourable trade winds. Yet the rich yield in climate data 
and the unforgettable adventure was well worth the ordeal, he says. 

The son of two school teachers, Bjork grew up in Svendborg, a small 
harbour town in southern Denmark with a long seafaring tradition. 
Eager to explore the world, he studied geography and became enthralled 
with Greenland, which has been part of the kingdom of Denmark since 
the early 1800s. While serving as a student assistant to Quaternary geol- 
ogist Kurt Kjzer of the Natural History Museum of Denmark, Bjork 
learned to mix traditional geology and expedition-era geography with 
modern high-tech data. 

His interest in historic records was piqued ten years ago when he 
first heard tales of a cache somewhere in Copenhagen filled with aerial 
images of Greenland taken in the 1930s during mapping expeditions. 
He eventually discovered that the formerly classified images had been 
locked away in a ramshackle citadel outside the city. Bjork sensed a 
unique research opportunity, but it took him years to sift through rows 
and rows of boxes filled with thousands oflight-sensitive negatives and 
to work out which might be useful. 

As he extended his search, he came across many more images taken 
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during Danish-US Greenland aerial surveys from the 1940s to the 
1970s. Then came pictures from Danish mapping expeditions carried 
out from 1978 to 1987. Bjork has now amassed a collection of 180,000 
aerial photographs (see ‘Flight lines’). But he has paid particular atten- 
tion to the first images he acquired, from the 1930s, because they 
chronicle a warm period that shares some similarities with the pre- 
sent. The mapping expeditions, led by the explorer Knud Rasmussen, 
were partly motivated by a dispute with Norway about which country 
had sovereignty over Greenland — a legal battle that the international 
court in The Hague settled in 1933 in favour of Denmark. But the crews 
were aware of the scientific potential of their work. “The instantaneous 
record contained in the air and terrestrial photographs of the pre- 
sent state of the glaciers of the entire coast is in itself of great value for 
future researchers,’ the Danish captain Carl Gabel-Jorgensen wrote in 
a 1935 report. 


“WITHOUT GPS OR OTHER 
PRECISION TOOLS THEY STILL 
MANAGED TO PRODUCE GLACIER 
DATA OF AMAZING QUALITY.” 


“These men were professionals,” says Eric Rignot, a glaciologist at 
the University of California, Irvine, whose group Bjork will join in the 
autumn as a postdoc. “Without GPS or other precision tools they still 
managed to produce glacier data of amazing quality” 

The image collection from the 1930s includes 10,000 overlapping 
aerial photographs captured with a rotating camera flown in an open 
hydroplane at 4,000 metres altitude. Bjork and his colleagues have used 
these images — alongside more recent aerial images and satellite obser- 
vations — to produce an 80-year record of how 132 glaciers in south- 
east Greenland have waxed and waned". They found that the glaciers 
retreated strongly during both warm periods but many have lost more 
ice during the record temperatures in recent decades. 

Not all glaciers have responded to warming in the same way. It has 
hit low-elevation glaciers harder than ones higher up, and glaciers that 
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1930s 
A territorial dispute with 
Norway prompted Denmark 
to conduct mapping 
expeditions in the 1930s, a 
relatively warm time when 
glaciers were retreating. 


482 | NATURE | VOL 535 | 28 JULY 2016 


terminate in the sea seem to be more vulnerable to the present rise in 
temperatures than those that end on land. Researchers are particularly 
concerned about how quickly these marine glaciers might shrink as they 
are attacked by the rapidly warming ocean. That may be a key factor in 
determining how quickly Greenland loses ice. 

The signs are not good. Hundreds of outlet glaciers that drain the 
ice sheet are losing substantially more mass and are thinning more 
than they are gaining through snow accumulation. Jakobshavn, one of 
Greenland’s largest glaciers, is moving faster than all the others and is the 
poster child for the fragile ice sheet, which some fear might approach a 
fatal tipping point as air temperatures continue to rise and the glaciers 
thin even more’. 

The findings from the historical studies released so far are a reminder 
that the current glacier retreat has lasted far longer and is more pro- 
nounced than the typical fluctuations over the past century. “It’s that 
kind of long-term perspective that helps us understand that the rapid 
changes we are seeing now are definitely more than just noise,” says 
Csatho, who has looked at satellite and aerial data to reconstruct how 
the surface elevation of the Greenland ice sheet changed at nearly 
100,000 locations from 1993 to 2012 (ref. 3). 


RECOVERED TREASURE 

Bjork’s work at the Arctic Institute in Copenhagen is more sedate than 
his trips to Greenland, but no less rewarding. The centre's archives hold 
treasured documents and memorabilia of Denmark’s Greenland expe- 
ditions, which are part of the Nordic kingdom's cultural DNA. Bjork 
gleams with the delight of an art lover as he unrolls maps drawn by Dan- 
ish explorer Hinrich Rink in 1851 of the mighty Jakobshavn. Carefully 
opening stitched albums, Bjork passes by snapshots of Inuit villagers 
with sealskin boats, and heads for the pages that most interest him — the 
sketches and images depicting the shapes of glaciers and their thickness. 

“Rink and later explorers documented very carefully where they were, 
and we can trust that what they painted is exactly what they saw,’ he says. 
“Their zeal and accuracy is a gift for us.” 

Bjork has done much of his work with fellow geographer Kristian 
Kjedlsen, with whom he shares an office in the Natural History Museum 
plastered with Greenland maps and Arctic photography. In a second 
study, they and their colleagues carefully picked out geographical fea- 
tures on old photographs, such as moraines of rock or vegetation lines 


Anders Bjark and his colleagues are using historical aerial surveys to assess Greenland's glaciers, 
including 309 major ones («) that drain the ice sheet. Blue lines represent flight tracks. 


1940s and 50s 
Many glaciers 
advanced during a 
regional cooling in 
the middle of the 
twentieth century. 


1960s and 70s 
Additional surveys 
filled in some gaps 
along the coastline. 
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Photographs of Ujaraannagq Valley in southwest Greenland in summer in 1936 (left) and in 2013 show that several glaciers have disappeared. 


that mark the greatest extent of the glaciers during the Little Ice Age at 
the end of the nineteenth century. 

They used these data, along with more recent aerial surveys, to deter- 
mine how the height of the glaciers has shifted, which provides a way 
to track changes in the ice mass. Their results indicate that the mean 
annual rate of ice loss between 2003 and 2010 was more than double the 
average for the twentieth century*. Moreover, striking spatial differences 
seem to confirm that the ice sheet’s response to changing climates is gov- 
erned by the topography of the underlying bedrock and by the geometry 
of the fjords through which outlet glaciers are flowing towards the sea. 


UNRULY ICE 

Greenland’s glaciers have been fickle over the past century, advancing 
in some places and retreating in others. By discovering when those 
changes happened and what kind of conditions prevailed at the time, 
Bjork and his co-workers hope to shed light on the ice sheet’s compli- 
cated mechanics — a missing piece in attempts to model its waxing and 
waning accurately. 

But nailing the ice sheet’s behaviour requires looking at as many 
individual glaciers as possible. This is just what Bjork and his group 
are doing now. Using all available historic records from Greenland, 
including pictures of glaciers in the least-explored high north, alongside 
modern satellite imagery, he aims to reconstruct the history of the 
island’s 309 biggest glaciers in unprecedented detail. 

The work is in full swing. On most days, two of his students toil 
away in the basement of the museum in front of computers, clicking 
on boulders, cliffs and other recognizable features that can be spotted 
in photographs taken decades apart. These are control points, which 
allow the researchers to match pictures captured at different alti- 
tudes and angles. Most aerial pictures were not taken looking straight 
down and hence lack a single scale. All of these oblique pictures need 
to be converted, one by one, to vertical views so that they can be 
transferred onto a common coordinate system. Without doing this 
geo-rectification, the researchers could not accurately measure the 
glaciers’ advances and retreats. 

Photos are not the only source of information. In the Arctic Institute 
ona sunny afternoon, Bjork looks at sketches that the German geologist 
Alfred Wegener produced during his last expedition, shortly before he 
died in November 1930 near an ice camp in central Greenland. Other 
albums hold illustrations of glaciers that scientists made during field 
trips in a cold spell in the late nineteenth century. 

So far, he has discovered about 600 sketches and paintings that might 
help to tell the story of Greenland’s glaciers before aerial photographs 


first became available. For some glaciers, official aerial pictures don't 
exist. To fill the gaps, he is also consulting privately held images that 
geologists have taken over the years. “New information is coming up 
from all sides,’ he says. 

In 2014, the daughters of two pilots involved in the 1930s surveys 
offered Bjork their large collections of photos and even an 8-millimetre 
film their fathers had taken during the campaigns. 

The emerging story is that ice disappeared very fast in the early 
twentieth century in the warming that followed the end of the Little 
Ice Age. Then the subsequent cool spell brought widespread glacial 
advances. The profound switch between shrinking and surging sug- 
gests that the glaciers are more sensitive to warmings and coolings than 
researchers had previously thought, says Bjork. But why some glaciers 
advanced forcefully at given periods and temperatures whereas others 
did not is still a puzzle. 

NASA has launched the Oceans Melting Greenland (OMG) project, 
led by Rignot, to provide glaciologists and ice-sheet modellers with 
unprecedented base maps of fjord bathymetry and other informa- 
tion needed to determine how glaciers interact with the sea. That is 
where one of Bjork's latest discoveries could prove useful. The historic 
records of sea-surface temperature that he unearthed can be combined 
with the individual histories of different glaciers to see how those that 
end in the ocean responded to changing marine conditions. The past 
behaviour of the ice, Rignot says, “matters a great lot when it comes to 
projecting its fate”. 

After all, the ultimate goal of this historical research is to look 
forward. Along with palaeoclimatic data from hundreds and thousands 
of years ago, the findings of Bjork and his colleagues from the recent past 
promise to increase confidence in the projections of ice-sheet models, 
says Richard Alley, a glaciologist at Pennsylvania State University in 
University Park. “We need history as well as modern observations to 
build and test predictive models.” 

For Bjork, the historical research goes well beyond science. It also 
connects him with the pioneering scientists and explorers he grew up 
admiring. He is grateful that their legacy is finally being dug out from 
the crypt. “It’s part of Nordic history,” he says, “anda real gift to modern 
science.” m 


Quirin Schiermeier writes for Nature from Munich, Germany. 
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BY EWEN CALLAWAY 


A lab in Belgium is using genetics to 
make the perfect beer yeast. 
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evin Verstrepen’s lab meetings can be pretty boozy affairs. Twice 
a week, several members of his group at Belgium's University 
of Leuven and the Flanders Institute for Biotechnology gather 
around a table loaded with black, tulip-shaped beer glasses, together with 
spit buckets and crackers. 

Verstrepen holds a glass and takes a whiff of its contents. “For me, this 
was an ethyl acetate bomb,” he pronounces, referring to a chemical found 
in pear-flavoured sweets that, at high concentrations, reeks of nail polish. 

Brigida Gallone, a graduate student in the lab, detects a second aroma: 
“ethyl acetate and 4- VG’, she says. That's 4-vinyl-guaiacol, which smells 
of smoke, cloves and — according to a tasting sheet in front of her — a 
dentist’s surgery. “I like 4-VG and this was too much for me.’ Another 
student, Stijn Mertens, catches a smell of wet cardboard, which is common 
to stale beers. “I got some trans-2-nonenal,’ he says. With that, the group 
finishes its analysis of this brew and moves onto the ninth and final glass. 
Itis not even 11 a.m.. 

“There's only so many you can do before you lose focus,’ says post- 
doc Miguel Roncoroni, who has been hosting these tastings for more 
than 4 months. They are part of a project to characterize some 200 com- 
mercially produced Belgian beers. Their assessments, alongside precise 
measurements of the dozens of chemicals that produce the flavours and 
aromas, could help consumers to identify new beers to try, by comparing 
the lab profiles to ones they like. 

But Verstrepen has loftier ambitions than helping beer lovers to select 
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their next bottle. He 
wants to build the 
perfect yeast. His lab 
is deploying what it 
is learning about the 
chemical and genetic 
basis of beer flavour to breed yeast strains that 
generate unique flavours and other qualities 
coveted by brewers and drinkers. 

The beer geeks in his lab straddle the worlds 
of curiosity-driven science and industrial brew- 
ing. They study evolution, biochemistry, and 
even neuroscience through yeast. But they also 
have contracts with beer makers worldwide, 
from multinational conglomerates to small 
trend-setting craft breweries. In an upcoming 
Cell paper, the lab will report the genomes of 
some 150 yeast strains used to make beer, sake 
and other fermented products, a project done in 
collaboration with a leading supplier of yeast to 
brewers and a synthetic-biology firm. 

For a US$500-billion industry whose 
products depend on complex interactions 
between chemistry and microbiology, sophis- 
ticated yeast strains are hot commodities. “You 
always want to know what's new in Kevin's lab,” 
says Peter Bouckaert, the brewmaster at New 
Belgium, a leading craft brewery in Fort Collins, 
Colorado. “People are watching what he does.” 

Beer gets its flavour from just a few ingre- 
dients (see “Better brewing through biology’). 
Grains — mainly malted barley — provide 
sugar and body, but can also imbue flavour, 
such as the chocolate notes common to dark 
stouts. Hop flowers bring bitterness as well 
as the tropical fruit notes in some craft beers. 
Dissolved minerals in the water influence the 
flavours that come through from grain and 
hops. And brewer's yeast, Saccharomyces cere- 
visiae, provides alcohol, bubbles and hundreds 
of aroma compounds. Fermentation produces 
everything from isoamyl acetate, which makes 
German Hefeweizens taste of banana, to the 
clove notes of 4-VG. 

Yeast science was spearheaded by beer 
makers. Denmark's Carlsberg brewery estab- 
lished one of the world’s first yeast-biology 
labs in 1875, and it was there that Emil Chris- 
tian Hansen isolated the first pure culture of a 
brewing yeast in 1883. In the 1930s and 40s, 
another Carlsberg scientist, @jvind Winge, 
discovered that yeast reproduces both sexually 
and asexually and used this insight to breed 
new strains with useful brewing traits. 

Winge’s work moved yeast from the brewery 
to biology labs, and many scientists now use 
brewer's yeast as a model to probe the inner 
workings of complex cells. But despite a long 
and fruitful marriage between yeast and bio- 
science, Verstrepen argues that many brewers 
are still stuck in the nineteenth century when it 
comes to the yeast they use. “Brewers, especially 
traditional brewers, are often not using the opti- 
mal yeast.” Most use just one strain — isolated 
from their brewery or borrowed from another 
decades ago — in all their beers. 


Kevin Verstrepen 
(centre) leads a beer- 
tasting session in his 
lab at the University of 
Leuven in Belgium. 


BETTER BREWING THROUGH BIOLOGY 
Four ingredients generally influence the 


Verstrepen wants to change that. He started 
out working in a South African wine yeast lab, 
then joined a University of Leuven beer lab for 
his PhD in 1999. But he was disappointed to 
find that most of the research involved trouble- 
shooting brewers’ problems. “Nobody was 
doing any biology, really,” he says. Disillusioned, 
he moved to the Whitehead Institute for Bio- 
medical Research in Cambridge, Massachusetts, 
to pursue a postdoc with Gerald Fink, who pio- 
neered genetic engineering in yeast in the 1970s. 

But although the scientists there liked yeast, 
nobody was interested in beer — at least not by 
day — Verstrepen says. So, his work centred 
on the proteins that pathogenic yeast deploy to 
stick to human tissue. Verstrepen discovered 
that the level of stickiness is dictated by the 
number of repeating DNA sequences ina cer- 
tain gene (K. J. Verstrepen et al. Nature Genet. 
37, 986-990; 2005). “It’s like having longer Vel- 
cro loops that stick more easily, he explains. The 
proteins are also responsible for flocculation, 
the process by which yeast cells clump together 
ina beer and fall out of solution. Flocculation 
varies between brewing strains and influences 
the flavour, clarity and alcohol content ofa beer. 

In 2005, Verstrepen moved across town to 
open a lab at Harvard University, focusing on 
the roles of various repetitive DNA sequences in 
generating diversity. He taught Harvard under- 
graduates biology in a course that incorporated 
brewing — “It was a pretty tough course,’ he 
says — but beer didnt feature in his research 
until he returned to Leuven in 2009. 

Verstrepen hoped to combine the kind of 
research he was doing with his interests in beer 
and wine. But it was a phone call from a Swiss 
chocolate company that jump-started his work 
with industry. Zurich-based Barry Callebaut, 
one of the world’s largest cocoa makers, needed 
help transforming bitter cacao beans into cocoa 
powder (which is traditionally done by yeasts 
present in the environment). “My response was 
‘Is chocolate fermented?’ That wasn't the smart- 
est thing to say,’ says Verstrepen. 

Nevertheless, the company became the first 
client of Verstrepen’s lab consultancy, which 
now earns around €500,000 ($555,000) per 
year. Half of the 25 or so scientists in his lab do 
applied research on beer, biofuels and other fer- 
mented products; the rest pursue epigenetics, 
molecular evolution and other basic research. 


LAGER LAB 

At first glance, the lab looks like any other. Cen- 
trifuges, petri dishes and pipettes cover benches. 
There is also an incubator full of small glass bot- 
tles. It would look at home in any microbiol- 
ogy lab, were the bottles not filled with a strong 
broth of malted barley, sugar and hops. The cold 
room offers more clues. “There's not too much 
beer in here now,’ Verstrepen says, pointing to a 
few full crates. And a stockroom contains hun- 
dreds of the black glasses used for blind tastings; 
they are opaque so tasters cannot see what they 
are sipping. > 


flavour and aroma of beer. 


Grains — usually barley — 

are moistened and allowed to 
germinate, and then dried and 
roasted. The resulting ‘malt’ 
imparts flavour that depends on 
the amount of roasting, which can 
make a beer taste of biscuits, 
coffee or chocolate. 


Water 


The malted grains are then 

milled, cooked in water and 
rinsed to produce a sugary broth 
known as wort. Depending on pH 
and mineral profile, the water used 
can dramatically shape the final 
flavour of the beer by altering the 
mouth feel and acidity. 


The wort is brought to a 
boil and hops are added. 
These flowers can generate a 
tremendous variety of flavours 
depending on the variety used 
and the timing of the addition. 
Beer owes its bitterness to hops, 
but it also gets citrusy, fruity 
and floral notes from them. 


The broth is cooled and 
yeast is added to produce 
alcohol and bubbles. Different 
yeasts can change the character 
of the beer dramatically, making 
it crisp or sour and imparting 
notes such as clove, grass, 
banana, leather, dried fruit or 
even horse blanket. 


TAPPING YEAST 


By-products of fermentation produce 
a wide variety of flavours and aromas, 
and their levels can be tuned by 
breeding new yeast strains. 
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> — Hislabs freezers house about 30,000 types 
of yeast, including 1,000 strains used by brew- 
ers, bakers and others worldwide, and another 
1,000 wild isolates from fruit, flowers, insects 
and even people. Many of these have been char- 
acterized for the genes known to influence taste 
and other traits that brewers care about. The lab 
is collaborating with yeast supplier White Labs 
in San Diego, California, and with Craig Venter’s 
Synthetic Genomics in nearby La Jolla to build 
a family tree of industrial yeast. 

The balance in the freezers is made up by 
the lab’s creations — completely new strains 
that have unique combinations of traits. The 
team makes them by mating different strains 
and then screening the offspring for the aro- 
mas they produce and, more recently, for genes 
that underlie these traits. This latter approach, 
known as genetic-marker-assisted breeding, is 
common in agriculture, and Verstrepen thinks 
that it will transform brewing. 

A Canadian brewery commissioned one 
of these custom strains when it wanted a full- 
flavoured Belgian ‘tripe! that has less alcohol 
than is typical for the style. Another brewery has 
asked for yeast that makes chocolatey aromas, a 
request that has stumped the lab so far. 


MASS MATING 
Using a robot that can accomplish hundreds of 
matings in a day, the lab generates many more 
strains than it can analyse or use in its taste tests. 
To cope with the glut, the researchers are devel- 
oping microfluidic chips that can brew with 
2,000 different yeasts at a time in 20-picolitre 
batches, each of which contains a single yeast 
cell. They can automatically test the resulting 
‘picobrews’ for alcohol content and hope even- 
tually to be able to measure the aroma com- 
pounds produced, too. At the other end of the 
spectrum, the lab will soon take delivery ofa kit 
to make 500-litre batches of beer to better appre- 
ciate the challenges of industrial-scale brewing. 
Verstrepen’s archive makes the lab a one-stop 
shop for brewers looking for unique flavours. 
For example, when Bouckaert wanted “funky” 
aromas, such as the smell of barnyards or horse 
blankets, but without the usual accompanying 
fruity notes, he tried four of the lab’s strains 
that showed some promise. None was going to 
work for a New Belgium beer — at least not yet. 
“Kevin's research is a little bit out there and on 
the edges of brewer's applications,’ Bouckaert 
says. “But that doesn’t mean it will not translate 
to something that could be huge in the future.” 
Natural variation in brewer’s yeast offers 
much leeway to dial up and down flavours 
and other traits, but the approach can only go 
so far, Verstrepen says. Genetic-modification 
tools could improve on that. “With breeding 
we can increase flavour 10-fold, with genetic 
modification we can increase it 100-fold or 
1,000-fold, and we’ve done this,” Verstre- 
pen says. “The beers you make are more like 
banana milkshakes. Is this something we really 
want to do?” Brewers are excited about the 
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work, but the stigma that surrounds geneti- 
cally modified (GM) foods means that the lab 
always uses more conventional techniques 
such as breeding and directed evolution to 
make strains destined for industry. 

And gene-editing techniques such as CRISPR 
could be used to introduce naturally occurring 
variants linked to flavour production into yeast 
strains that perform well but don't impart much 
flavour. This would accomplish the same goals 
as conventional breeding much more quickly. 
Food regulators are already grappling with the 
question of whether plants and livestock made 
this way count as GM foods, so it’s not incon- 
ceivable that CRISPR beers could force the issue. 


“THE BEERS YOU 
MAKE ARE MORE 
LIKE BANANA 
MILKSHAKES. 
IS THIS SOME- 
THING WE REALLY 
WANT T0 DO? 


A few craft breweries have asked the lab for 
GM yeast (they were turned down), but most 
of the brewing industry has little appetite for it, 
says Bouckaert. “American craft is pushing the 
boundaries, but on genetic modification, it’s a 
no-no,’ he says. Multinational brewers are even 
more skittish about being linked to GM beer. 

Leuven-based AB InBev, the world’s biggest 
brewing conglomerate and the maker of Bud- 
weiser and Stella Artois, confirmed that it works 
with Verstrepen. Philippe Malcorps, a senior 
scientist in the company’s yeast and fermenta- 
tion division, says that AB InBev is interested 
in tapping the diversity of wild yeasts to get fla- 
vours and other traits that would be impossible 
to find in classic brewing strains. 

Much of Verstrepen’s work with such compa- 
nies is covered by strict non-disclosure agree- 
ments. “Some of our best applied research, we 
cannot publish, we cannot talk about,’ he says, 
“That is frustrating as a scientist, because you 
want to share.” 

But smaller breweries tend to be more liberal. 
In fact, Orval, one of the six Belgian breweries 
run by Trappist monks, declined to sign a non- 
disclosure agreement or even a contract when 
it asked Verstrepen to sequence its house strain. 
“If you cannot trust monks any more, what are 
you going to do?” says Verstrepen (who keeps 
his personal refrigerator stocked with Orval, 
one of his favourite beers). Bouckaert says that 


he was offered exclusive access to the yeast 
strains his company was trying, but he balked. 
“T said, I’m acraft brewer. I don't want exclusiv- 
ity. 1 want you to send them to more people!” 

The applied work has led to scientific insights. 
Verstrepen’s team is still grappling with a project 
that originated from trouble-shooting a prob- 
lem it heard about from brewers. Often, brewers 
reuse yeast to cut costs. But periodically, they 
start new cultures, growing them first on glu- 
cose (yeast’s preferred food), then adding them 
to unfermented beer, which contains mainly 
maltose. Yeast sometimes responds sluggishly to 
the new food source, and it can take days before 
the beer is fermenting at full clip (and that leaves 
it prey to contaminants). “Yeast remember not 
only what they were eating, but also what their 
grand-grand-grandmothers were eating, up to 
five or six generations,’ Verstrepen says. Work- 
ing out the mechanism, which seems to involve 
the inheritance of epigenetic modifications to 
DNA or proteins, is now a focus of the lab. 

Often, the lab has found, good beer and good 
science go hand in hand. For his PhD project, 
Mertens was tasked with creating new strains of 
the yeast used to make lagers. Lager is brewed 
at cooler temperatures than other beers, using 
a yeast that emerged several centuries ago when 
S. cerevisiae hybridized with a related cold-tol- 
erant species called Saccharomyces eubayanus. 

Although responsible for the vast majority 
of beer sales, lagers tend to have a limited fla- 
vour range. To make yeasts that might expand 
this palette, Mertens coaxed various strains of 
the two species into mating. Some beers made 
with these yeasts tasted of onion, and many fer- 
mented poorly, but one combined the bracing 
crispness of a good pilsner (a pale lager) with 
hints of fruity aromas that are untypical of the 
style (S. Mertens et al. Appl. Environ. Microbiol. 
81, 8202-8214; 2015). The beer was so popular 
among lab members that bottles vanished from 
the cold room. Several breweries are now testing 
that strain, known as H29. 

Mertens would love to see a commercial 
beer made from his creation, but he also hopes 
to sequence the genomes of the other yeasts he 
has created to work out how species hybrid- 
ize — and maybe even unpick the conditions 
that spawned the first lager yeast. “We have 
new yeast and the brewers love it,” he says. “But 
we're looking at the fundamentals of how these 
hybrids work. That goes beyond beer science” 

By the end of the morning’s beer tasting, 
the spit buckets are filling up. Verstrepen has 
a meeting with a DNA-sequencing company, 
and Mertens and the other students have their 
research projects to get to. The lab might be a 
magnet for beer geeks, but it is no keg party. 

“Yes, youre working with a very fun 
product, but at the end of the day, it’s genetics 
work,’ Mertens says. “We're not drinking for 
fun.” At least, not until after work. m 


Ewen Callaway writes for Nature from 
London. 
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Lessons from Brexit 


Britain’s vote to leave the European Union is jeopardizing scientists’ funds, 
collaborations, staff and students; it has left the nation reeling and Europe 
vulnerable. These schismatic times have researchers worldwide soul-searching 
over how best to contribute. Five experts offer their reflections. 


JOHAN SCHOT 
Rethink social 
progress 


Director of Science Policy Research 
Unit, University of Sussex 


The popular rebuke to reason that was 
Britain’s vote to leave the European Union 
is a wake-up call. Our world requires an 
urgent rethinking of social progress. The 


sciences, social sciences and humani- 
ties should collaborate and open up their 
research agendas for public engagement 
and interdisciplinary dialogue to work 
towards a diversity of possible solutions to 
address the troubles of our time. 

In the debate about the consequences 
of Brexit, the EU is often conflated with 
Europe. But the start of international gov- 
ernance with a strong European outlook 
came long before — during the nine- 
teenth century — with industrialization 
and globalization. The EU is hence part 
of a greater ‘Europeanization process 


involving a web of multiple organizations 
and dependencies. Leaving the EU does 
not mean leaving Europe. 

The deliberations about Europe's future 
should therefore look more widely to the 
crucial question of how to organize and 
manage the international flows of goods, 
people, information, pandemics and pollu- 
tion — inevitable in our globalized world. 

Many have, rightly, criticized the EU for 
its technocratic character. Yet this is present 
in all international organizations — from the 
United Nations and the Council of Europe 
to the European Space Agency and > 
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>» CERN, Europeans particle-physics lab. 
These bodies have been built on the belief in 
the rule of experts — people like us — our 
rational decision-making and, if we're hon- 
est, our preference for avoiding overt politi- 
cal and public deliberation. 

It is time to reappraise these axioms of 
both European and global governance. 
Current designs reflect the old dream of 
building a European, or even global, state 
that would eventually surpass nation-states. 
Public support for this is clearly faltering: 
many issues need localized solutions. New 
constructs for local, national and inter- 
national governance are needed that com- 
bine technocracy and democracy to reduce 
the democratic deficit of which Brexit is a 
symptom. 

This is crucial if we are to address chal- 
lenging interconnected issues such as rising 
inequalities, unem- 


ployment, growing “We should 
migration, climate encourage 
change, human experimentation 
rights, arms control, with avariety of 
cyber-insecurity new democratic 
and terrorism. models.” 


Brexit must not 
simply prompt organizations in Europe to 
engage in yet another round of navel-gazing 
discussion focused on the future of their own 
interests, particularly funding and trade. 
Instead, we should encourage experi- 
mentation with a variety of new democratic 
models at all levels to create more partici- 
pative forms of policy-shaping. We should 
focus less on stimulating research, develop- 
ment and innovation across the board, as if 
it were all positive. Instead we must open up 
the debate about how knowledge production 
contributes to social progress to address the 
societal and environmental challenges of the 
twenty-first century. 


LEVENTE LITTVAY 
Compare populist 
movements 


Associate professor of political 
science, Central European University, 
Budapest 


The campaign that resulted in Britain 
voting to leave the EU was a populist appeal 
to the basest instincts of the economically, 
socially and culturally disenfranchised, 
fuelled by misinformation and a dismissal 
of any sincere debate of the potentially 
cataclysmic consequences. It follows the 
rise of populist parties using similar tac- 
tics in the 2014 European parliamentary 
election, and the unstoppable appeal of 
Donald Trump. We need to improve our 
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understanding of these convulsions. 

Systematic research on populism is still 
in its infancy. Social scientists are divided 
even on the definition, and on how it is dis- 
tinct from more established phenomena 
such as the radical right, nationalism and 
xenophobia. Country experts often have 
good contextual knowledge of relevant 
cases — from France’s Front National on 
the right, through Greece's Syriza on the 
left to the ideologically unidentifiable Five 
Star Movement in Italy. But scholars have 
only recently started to develop and col- 
lect high-quality data that are comparable 
across countries, such as the rhetoric of 
party leaders and the attitudes of their sup- 
porters. Such information will be crucial 
to understanding both the causes and the 
long-term consequences of populism. 

In the short term, populist rhetoric may 
have the positive effect of engaging citizens 
in the democratic process. But these citi- 
zens rarely become informed. Responding 
to misleading emotional appeals, they can 
end up voting against their own interests 
— as with those in some UK regions who 
voted in droves to quit the union, despite 
being in receipt of the greatest EU largesse 
and having the smallest numbers of immi- 
grants. In the longer term, populist politics 
can increase social polarization, erode trust 
and be profoundly damaging to society or 
the economy, as happened in oil-rich Vene- 
zuela, which is now practically a failed state. 

It is possible that better institutional 
design and more effective responses to 
populist rhetoric could mitigate the power- 
ful negative effects of such movements. In 
the absence of truly comparable data, we 
cannot yet assess what the most appropriate 
governance or communication strategies are 
to counter these politics. We do know what 
sort of research is needed. 

It is crucial to develop measures that 
can be compared across countries. These 
should identify politicians who are popu- 
lists, pinpoint which voters are suscepti- 
ble and predict their responses to types of 
counter-messaging. Collection of such data 
is a resource-intensive task, but necessary 
to better understand the conditions under 
which populism rises, to chart whether its 
effect (even in moderation) is necessarily 
damaging, and if so, to work out how to 
defeat it. 

In this way, we will be able to explore 
when and how past instances of populism 
— which have been studied for decades by 
Latin American scholars — are applica- 
ble today in Europe, the United States and 
beyond. Such studies will also determine 
to what extent one populist movement in 
Europe is analogous to another, even though 
they are often ideologically very different. 

One promising effort is Team Populism 
(populism.byu.edu). This consortium of 


researchers, directed by political scientist 
Kirk Hawkins at Brigham Young University 
in Provo, Utah, is working to improve com- 
parative assessments of populism. It brings 
together researchers from different levels 
of analysis (individuals, institutions, coun- 
tries), various methodological convictions 
(researchers of mass attitudes, behavioural 
experimentalists, scholars studying politi- 
cians and discourse analysts), and different 
areas of expertise (scholars of the radical 
right in Europe and the populist left in Latin 
America). Such endeavours should help us 
to tackle the alarming kind of politics that 
led to Brexit. 


PETER TURCHIN 
Mine the past 
for patterns 


Professor in the departments of 
ecology and evolutionary biology, 
anthropology and mathematics, 
University of Connecticut 


Getting people to cooperate in very large 
groups such as the EU is difficult. Under- 
standing how humans have been able to cre- 
ate cooperative societies is also hard, because 
we cannot readily run experiments. Never- 
theless, much progress can be achieved by 
taking a scientific approach to analysing 
historical data. 

Asan example, the EU’s rapid expansion 
from the original group of 6 states to the cur- 
rent 28 has clearly contributed to its dysfunc- 
tion. Historians have a name for it: “imperial 
overstretch” (a classic example is the bloated 
empire of Alexander the Great, which frag- 
mented soon after his death). 

Dysfunction arises because, first, it’s 
easier for six people (or six heads of state)to 
converge on a mutually agreeable course of 
action than for 28 to do so. Second, expan- 
sion beyond the western European ‘core’ 
brought together people, and politicians, 
from diverse cultures with different values, 
taking incompatible paths towards coop- 
eration. Such normative and institutional 
mismatch created extra barriers to collec- 
tive action. 

Would European integration be bet- 
ter served by a more ‘modular’, step-wise 
approach? For example, central European 
countries already have their own ‘integration 
nucleus’ — the Visegrad Group of Poland, 
Hungary, Slovakia and the Czech Repub- 
lic. Perhaps the EU would work better as a 
nested set of such groups rather than one 
large one that relies on informal arrange- 
ments between the more powerful states? 

Such hypotheses need testing, empiri- 
cally and systematically — with massive 
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historical databases that thoroughly sample 
the historical record (see, for example, the 
Seshat: Global History Databank; seshatda- 
tabank.info). Here are some of the questions 
that need to be asked. 

What administrative arrangements and 
political institutions aided cooperation in 
large empires (which often started as con- 
federations), such as Rome, the Maratha 
Confederacy on the Indian subcontinent 
and the United States? What can we learn 
from the fate of the Habsburg Empire — 
the previous, failed, attempt at a ‘European 
Union, formed through a series of dynastic 
marriages? Does gradual, incremental con- 
struction result in a longer-lasting union? 
What kind of hierarchy of political units 
works better: a flat one with a single level, 
or anested, multilevel one? 

This sort of analytical, predictive history 
— known as cliodynamics — can inform 
the design of a better, more cooperative 
EU (P. Turchin Nature 454, 34-35; 2008). 
But there is a marked tendency among 
policymakers to deal with the economic 
and political crises of today as though they 
were unprecedented, leading them to repeat 
old mistakes. 

The first step should be to invest massively 
in research on how human cooperation at 
large social scales has been achieved in the 
past. We all have a huge stake in European 
peace, prosperity and collaboration — even 
those outside Europe. We shouldn't just leave 
it to the politicians. 


STEVEN J. BRAMS 
Offer more voting 
options 


Professor of politics at New York 
University 


Of the many things that the United King- 
dom’s referendum has demonstrated, one 
is that voting, even by a plebiscite, need not 
be so crude. The stark choices offered to 
UK voters — remain in the EU or leave — 
belied the complexity of the issues involved. 
Instead, the British public should have been 
offered the choice of different levels of par- 
tial separation — three, say — bracketed by 
the two all-or-nothing options, giving them 
five choices in all. 

True, more choices mean that a vote will 
be more finely divided, allowing one option 
to win with as little as 21% of the vote instead 
of the 52% that actually occurred. A system 
called approval voting obviates this problem, 
however, by allowing voters to indicate all 
courses of action they consider acceptable. 
The one with the greatest number of approv- 
als wins. 


The approach tends to find the most 
acceptable option over all, not the one with 
the strongest minority support that benefits 
from a divided field. Typically, the winner is 
a compromise, which receives support from 
voters on both sides of the ideological divide. 
It is one that most voters can live with, even 
if it is not their first choice. 

Unlike voting systems in which vot- 
ers can rank alternatives, which are used 

in such countries as 


“Voting,even Australia and Ire- 


by aplebiscite, land, approval voting 
neednotbeso  isagrading system — 
crude.” with only two grades 


(approve or don’t 
approve). It avoids a serious problem with 
ranking systems in which there may be a 
cycle of top choices (majorities prefer A to 
B, B to C, and C to A); these cyclical majori- 
ties leave unclear which alternative should 
be the winner. Under approval voting, if A 
beats B, and B beats C, A beats C. So, except 
for when the result is tied, there is always 
an alternative that is approved more than 
any other. 

Approval voting is now used to elect 
officers in several major scientific socie- 
ties, including the American Mathematical 
Society and the Mathematical Association 
of America, each with tens of thousands of 
members. At my university, this system is 
used by the politics and economics depart- 
ments to choose a chair. All tenured pro- 
fessors are candidates; the vote invariably 
elects candidates who bridge factions in the 
departments. In the US state of Oregon in 
1990, approval voting was used in an advi- 
sory referendum on financing education. 
Voters could choose from five different lev- 
els of sales and income taxes, but because 
it was advisory, no choice was mandated. 

If Britain’s exit does precipitate referen- 
dums elsewhere in Europe, citizens should 
be given more than two options. This mim- 
ics the multiple choices that members of 
parliament typically have when they add 
and remove provisions from bills, pro- 
moting compromises. Shouldn't voters be 
afforded the same opportunities as their 
representatives? 


SIMON GACHTER 
Study how groups 
collaborate 


Professor of the psychology of 
economic decision-making, 
University of Nottingham 


Britain’s decision to quit the EU can be 
interpreted as scepticism towards deep 
multilateral cooperation. This raises a 


fundamental question for social-science 
research: under what conditions can 
groups of people cooperate for the com- 
mon benefit? So far, behavioural eco- 
nomics has overwhelmingly focused on 
investigating people’s willingness to pull 
their weight for the benefit of their group 
— for example, how fishermen work 
together to secure the biggest catch for a 
village. But in a multilateral international 
cooperation, such as the EU, groups — or 
nations — work together to secure gains 
that exceed the possibilities of any indi- 
vidual nation going it alone. European 
research collaboration and the EU carbon 
market are prime examples. 

Coalitions of diverse groups are 
necessary to address global challenges, 
but they are always jeopardized by the 
free-riding of groups who consider their 
self-interest — possibly fuelled by strong 
group identity — to be the only relevant 
criteria for collective decision-making. 
Witness the struggle to forge international 
climate agreements. Strong group identi- 
ties can help people with a common sense 
of purpose to cooperate for example, in 
local communities, companies, univer- 
sities, army units or sports teams. But 
such tribalism may undermine teamwork 
between groups, in particular when the 
costs of working together seem to be more 
salient than its benefits — as seems to be 
increasingly the case with citizens’ views 
of European cooperation. 

New experimental studies should in 
particular investigate the influence of 
identity on collaboration within and 
between groups when collective benefit 
and self-interest are in tension, or seem 
to be so. 

Urgently needed are good data to validate 
pundits’ hunches about people's motivations 
in voting in the referendum, and how those 
motives interacted with sociodemographics 
and social networks on and offline. For 
example, how important were immigra- 
tion and identity? Which other considera- 
tions mattered for whom, where and why? 
There is impressionistic evidence about 
some motives but there are not enough hard 
data from large-scale representative surveys. 
These might also include simple behavioural 
experiments to probe the role of basic behav- 
ioural tendencies such as impatience, risk 
aversion, reciprocity, altruism and trust in 
strangers. 

Answers to these questions are immedi- 
ately important for evidence-based policy 
responses in the United Kingdom and in 
the EU. More broadly, understanding the 
relative importance of economic, socio- 
logical and psychological determinants 
of people’s propensities to support trans- 
national cooperation is crucial in an age of 
globalized challenges. m 
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Mary Shelley, painted around 1840 by Richard Rothwell and housed in the National Portrait Gallery. 


The science that 
ted Frankenstein 


Richard Holmes ponders the discoveries that inspired the 
young Mary Shelley to write her classic, 200 years ago. 


what many view as the first true work 
of science fiction — and unleashed one 
of the most subversive attacks on modern 
science ever written. Eighteen-year-old 
Mary Godwin (as she then was) had the 
idea for Frankenstein, or The Modern 


I n 1816, a teenager began to compose 
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Prometheus that summer, while at the 
Villa Diodati on Lake Geneva in Switzer- 
land, with her lover and future husband 
Percy Bysshe Shelley, and his friend and 
fellow poet Lord Byron. Forced inside 
by stormy weather, the group spent wild 
evenings telling ghost stories, while 


Byron's personal physician, the brilliant 
20-year-old John William Polidori, regaled 
them with reports of the latest develop- 
ments in medical science. 

Mary’s inventive mind was peculiarly 
primed to grapple with both literary and 
scientific controversy. Her mother was the 
feminist writer Mary Wollstonecraft, who 
had died from complications after Mary’s 
birth. Her father was anarchist philoso- 
pher and novelist William Godwin, whose 
friends included chemists and pioneering 
electricity researchers Humphry Davy 
and William Nicholson, and the opium- 
addicted poet Samuel Taylor Coleridge. 
These influences shaped her youthful 
thinking, and were encouraged by Shelley, 
who had dabbled in science at the Univer- 
sity of Oxford before being thrown out for 
atheism. 


GOTHIC DRAMA 

The myth of Victor Frankenstein, the 
crazed but idealistic young scientist who 
unwittingly lets loose his monstrous crea- 
tion and struggles to accept responsibility, 
is a heady cocktail of gothic melodrama 
and disturbing speculation. It has proved 
astonishingly adaptable. The first theat- 
rical version, Presumption: or the Fate of 
Frankenstein, opened at the English Opera 
House in London in 1823, to huge audi- 
ences and scandalous publicity (“Do not 
take your wives, do not take your daugh- 
ters, do not take your families”). Mary 
Shelley attended, noting that “in the early 
performances all the ladies fainted and 
hubbub ensued!” There have been more 
than 90 dramatizations since, including 
the Danny Boyle-directed 2011 produc- 
tion at London’s National Theatre, which 
opened with the Creature dropping naked 
from a huge, pulsating artificial womb. 
The story has also been adapted for more 
than 70 films, including James Whale’s 
iconic 1931 Frankenstein starring Boris 
Karloff. In May this year, a Frankenstein 
ballet was staged at the Royal Opera House 
in London. Choreographer Liam Scarlett 
shrewdly analysed it as a love story: “The 
Creature is like an infant. He’s desperately 
seeking a parent or loved one to take him 
through the world.” 

Although the myth is well known, the 
original novel is not. There are three ver- 
sions. Mary Shelley began to write the first, 
probably as a short story, in two notebooks at 
Villa Diodati, expanding it during the winter 
of 1816-17 in simple direct prose of great 
intensity (the notebooks remained unpub- 
lished until 2008). The second, lightly edited 
by her husband and more literary in manner, 
was published in 1818. The third was radi- 
cally revised by Mary Shelley alone, and was 
published in 1831, with a fascinating new 
introduction by her. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


PHOTO: TARKER/BRIDGEMAN IMAGES 


With each version, the basic plot remains 
the same, but the tone grows darker. Frank- 
enstein becomes more passionate and ambi- 
tious, his science becomes more sinister 
and misdirected (“TI felt as if my soul were 
grappling with a palpable enemy”) and his 
Creature becomes more alienated and ago- 
nized. The 1831 introduction also contains 
an inventive, retrospective account of the 
storytelling competition at the villa. Mary 
now calls the book her “hideous progeny’, 
and claims that the whole idea came to her 
instantly, like an emotional bolt of summer 
lightning on waking from a terrible night- 
mare. “I saw — with shut eyes but acute 
mental vision — I saw the pale student of 
unhallowed arts kneeling beside the thing 
he had put together. I saw the hideous phan- 
tasm of a man stretched out, and then, on 
the working of some powerful engine, show 

signs of life, and stir 


“The early with an uneasy, half 
chapters evoke vital motion.” 
the mysteries The book may, 


of experiment, however, have 
native excitement had a more intel- 
about electrical _lectual genesis. 


kites and the The best contem- 
fascination of porary account 
air pumps.” of the ghost-story 


competition is 
Polidori’s. A medical graduate of the Univer- 
sity of Edinburgh, he had written his doc- 
toral thesis on sleepwalking. Before the trip, 
he was commissioned by the publisher John 
Murray to keep a secret journal of Byron’s 
adventures, and in this he recorded the villa 
party’s speculative conversations and read- 
ing of German gothic “horror tales”. Above 
all, he noted their wide-ranging discussions 
of fundamental scientific principles, and 
whether the human body “was thought to 
be merely an instrument”. As Polidori put it, 
their brains “whizzed”. 


SCIENCE FACT 
Polidori would have known about recent 
experiments in electrical resurrection 
techniques by Italian physicist Giovanni 
Aldini (nephew of bio-electrician Luigi 
Galvani), and the new anatomical theories 
of German physiologists such as Johann 
Friedrich Blumenbach. Also making waves 
were the fierce ‘vitalist’ debates at England’s 
Royal College of Surgeons between John 
Abernethy and William Lawrence, about 
the possible existence of an electrical ‘life- 
force and the unique nature of human 
consciousness. These controversial ideas, 
alive in the great universities and research 
centres of Europe, fed into Frankenstein, 
and especially into the moral issues that it 
raised about the perils of scientific interfer- 
ence with nature. 

Thus began a writing process involving 
careful research over many months. p> 
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Books in brief 


The Tale of the Axe: How the Neolithic Revolution Transformed 
Britain 

David Miles THAMES & HUDSON (2016) 

This illuminating treatise on the Neolithic era in Britain treats the 
polished-stone axe that gives the age its name as a portal into 
prehistory — a revelation of material, manufacture and function. 
Drawing on research riches from Turkey’s Catalhoyiik site to Britain’s 
Stonehenge, archaeologist David Miles contextualizes his core 
chronicle of how tools, farming and metallurgy arrived in the British 
Isles. As layered as the strata of an archaeological dig, this is a moving 
portrait of a people at a cultural and technological tipping point. 


Nature in the City: Bengaluru in the Past, Present, and Future 
Harini Nagendra OXFORD UNIVERSITY PRESS INDIA (2016) 

With 10 million people and pell-mell development, Bengaluru 
(India’s Silicon Valley, also known as Bangalore) is an old city in 
thoroughly modern flux. Urban ecologist Harini Nagendra’s study 
looks at its deep ecological history, colonial role as India’s garden city 
and current struggle with pollution, social exclusion and residents’ 
increasing detachment from nature. Marshalling research from 
satellite imaging to interviews with slum dwellers, she concludes that 
“cities need to be ecologically as well as socially smart”, and sees 
solutions in cross-city engagement of governance and civil society. 


Water in Plain Sight: Hope for a Thirsty World 

Judith D. Schwartz ST MARTIN’s (2016) 

Water security demands holistic, ecosystem-oriented solutions, 
argues Judith Schwartz in this stellar global tour of innovative soil and 
biodiversity restoration and water harvesting. In Zimbabwe, ecologist 
Allan Savory reveals how intensified grazing by wild ruminants is 
enabling 95% of rainfall to soak into the soil, and rivers to recover. In 
Brazil, researcher Antonio Nobre exposes how deforestation damages 
the Amazon’s unparallelled “forest-rain dynamics” and promotes 
drought. And in the Texas desert, permaculturalist Markus Ottmers 
unveils a built “ecosystem fuelled by variants of dew”. Inspiring. 


The Grid: The Fraying Wires Between Americans and Our Energy 
Future 

Gretchen Bakke BLOOMSBURY (2016) 

The US electricity grid, cultural anthropologist Gretchen Bakke 
reminds us in this cogent study, dominates US energy but is extremely 
vulnerable — and not just to gnawing squirrels. Nationalized and 
predicated on power plants, it’s a poor fit with the variable, localized 
output of renewables. Bakke traces it inception by pioneers such as 
business magnate Samuel Insull through its technological, political 
and industrial evolution. Working towards a “self-healing, processor- 
dense ‘intelligent’ grid”, she argues, is the key to energy resilience. 


And Soon | Heard a Roaring Wind: A Natural History of Moving Air 
Bill Streever LITTLE, BROWN (2016) 

As his 2009 Cold and 2013 Heat (both Little, Brown) attest, biologist 
and nature writer Bill Streever is drawn to extremes. He now tackles 
strong winds, from cyclones to Santa Anas, for a scientific history 

of storms, meteorology and wind power, studded with pioneers 
such as seventeenth-century astronomer and trade-wind mapper 
Edmond Halley. A chronicle of Streever’s voyage under sail from 
Texas to Guatemala is threaded through, giving a breezy immediacy 
to the story of how we learned to decode “moving air”. Barbara Kiser 
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Shelley first mentions this in her journal 
for 24 July 1816. She was in Switzerland 
while walking above Chamonix towards 
Mont Blanc, absorbing the bleak land- 
scape of the Mer de Glace glacier that would 
later fill the book’s central confrontation 
between scientist and Creature. “Nothing 
can be more desolate than the ascent of this 
mountain ... we arrived wet to the skin... I 
write my story”. Her notes on triumphantly 
completing the first draft, “Transcribe and 
correct F[rankenstein] ... Finish transcrib- 
ing” do not appear until April and May 
1817, just four months before the birth of 
her third child, Clara. It is no accident that 
metaphors of pregnancy, birthing and par- 
entage suffuse this novel about the creation 
of life. 


STREAMS OF INFLUENCE 
In the intervening period of composi- 
tion, back in England, Mary Shelley’s 
journal reveals an impressive reading list. 
She absorbed the extreme accounts of 
polar exploration in George Anson's 1748 
Voyage Round the World; the distinction 
between alchemy and chemistry in Davy’s 
1812 Elements of Chemical Philosophy 
(based on his famous London lectures); 
and the new concepts of brain develop- 
ment explored in Lawrence's physiological 
lectures, given in 1816-17. In Coleridge’s 
1798 poem Rime of the Ancient Mariner, 
she encountered the psychology of guilt 
and abandonment; in John Milton’s 1667 
Paradise Lost, the theme of the demonic 
outcast. Her husband also made clear, in 
his anonymous preface to the 1818 edition, 
that they had discussed the scientific 
poetry of Erasmus Darwin, in The Temple 
of Nature, or The Origin of Society (1803). 
Everything she devoured was brilliantly 
recast as a new genre: science fiction. 

Thus, Davy’s lectures at London’s Royal 
Institution were subtly transposed, some- 
times almost phrase by phrase, into those of 
the fictional Dr Waldman, praising the work 
of contemporary scientists to young Frank- 
enstein. “These philosophers ... penetrate 
into the recesses of nature, and show how she 
works in her hiding places. They ascend into 
the heavens; they have discovered how the 
blood circulates, and the nature of the air we 
breathe. They have acquired new and almost 
unlimited powers; they can command the 
thunders of heaven, mimic the earthquake, 
and even mock the invisible world with its 
own shadows.” 

From her first draft, Mary had devised 
a complex structure that nests three auto- 
biographical narratives one within the 
other like Russian dolls, each bringing a 
different interpretation to the Franken- 
stein myth. The first, often overlooked in 
adaptations, is by polar explorer Robert 
Walton. Told in the form of letters to his 
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Frankenstein’s monster in the book’s 1831 edition; 
played by Boris Karloff in 1931; and in a 2016 ballet. 


sister, it bookends the novel in the Arctic 
Ocean, and presents a moral enigma. Is 
the idealistic young Frankenstein essen- 
tially philanthropic, blindly ambitious or 
simply insane? And is his Creature evil or 
innocent — an ugly outcast or a persecuted 
victim longing for love? 

The second autobiography is Frank- 
enstein’s own, particularly his thrilling 
discovery of the deep “enticements of 
science”. These early chapters are among 
the first fictional presentations of the 


education of a young scientist, evoking the 
mysteries of experiment, naive excitement 
about electrical kites and the fascination of 
air pumps. Brilliantly transformed in the 
1831 edition, these become more sophis- 
ticated references to galvanism, the neces- 
sity of mathematics, the genius of Isaac 
Newton and the intoxicating delights and 
dangers of charismatic science lecturing. 

The third narrative, dramatically held 
back until halfway through, is the Crea- 
ture’s. Written in a wholly different stylistic 
register, it swings violently between des- 
perate exclamations, poignant appeals and 
furious menacings. In the great showdown 
with Frankenstein on the Mer de Glace, the 
Creature begs the scientist to delve further 
into experimentation to create a female com- 
panion whom he can love. 

Faced with this terrible ethical dilemma, 
Frankenstein agrees: this second creation 
scene, ina secret laboratory on the Orkney 
Islands off northeast Scotland, is also often 
overlooked. Fearful of the consequences, 
he destroys his female creation at the last 
moment, turning the disappointed Crea- 
ture into a vengeful demon. So emerges 
the central drama of the novel. It is not 
merely the creation of life itself, the tech- 
nical ambition of science, that is called 
into question. It is the unfolding moral 
choices and unforeseen ethical respon- 
sibilities that may come with scientific 
advances: artificial intelligence or artificial 
life, nuclear power or nuclear weaponry, 
the genome sequence or invasive genetic 
editing. 

One added irony makes Shelley’s novel 
much greater than any film — and greater 
indeed than its popular interpretation as 
an anti-science myth. It is that in these 
exchanges, paradoxically, the Creature 
becomes even more expressive and human 
than Frankenstein. He produces arias of 
speech, begging for justice, understand- 
ing, compassion and human rights. In 
the encounter in the Alps, the Creature 
declares himself Frankenstein’s unique 
responsibility: “I ought to be thy Adam, but 
I am rather the fallen angel, whom thou 
drivest from joy for no misdeed ... Every 
where I see bliss, from which I alone am 
irrevocably excluded ... Misery made me 
a fiend. Make me happy, and I shall again 
be virtuous.” 

That is the enduring youthful genius and 
imaginative generosity of Mary Shelley’s 
Frankenstein. It proclaims that the alien, 
the outcast, the rejected, finally must have 
claims on our humanity. And claims on our 
science, too. m 


Richard Holmes is the author of The Age 
of Wonder, which won the 2009 Royal 
Society Prize for Science Books. 

e-mail: richard. holmes. biog@gmail.com 
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Fresh strategies to 
save orangutans 


The Bornean orangutan (Pongo 
pygmaeus) was listed as critically 
endangered by the International 
Union for Conservation of 
Nature this month, despite 
decades of conservation efforts. 
We urgently need fresh strategies 
to counteract habitat loss and 
hunting, and to mitigate the 
impacts of climate change. 

Both species of Asia's only 
great ape (P. pygmaeus and the 
Sumatran orangutan, P abelii) 
now face ‘an extremely high 
risk of extinction. Safeguarding 
protected areas and enforcing 
existing bans on orangutan killing 
are still important, but such 
efforts are insufficient in isolation 
(E. Meijaard et al. Ann.N. Y. Acad. 
Sci. 1249, 29-44; 2012). 

We need to maximize the 
conservation value of degraded 
lands and use an integrated, 
landscape-scale approach to 
reconcile economic development 
and biodiversity conservation. 
And we must promote sustainable 
management practices that are 
backed by rigorous, transparent 
certification mechanisms and that 
incorporate explicit economic 
valuation of the ecosystem 
services provided by orangutans 
and their habitats. 

Crucially, success will also 
demand direct engagement 
and collaboration between 
conservationists and the mining, 
forestry and oil-palm industries. 
Andrew J. Marshall University of 
Michigan, Ann Arbor, USA. 
Serge Wich Liverpool John 
Moores University, UK. 

Marc Ancrenaz Borneo Futures, 
Kota Kinabalu, Sabah, Malaysia. 
ajmarsha@umich.edu 


Vet reproducibility 
of biology preprints 


Posting preprints in online 
repositories is common practice 
in the physical sciences and 
mathematics. It has been less 
satisfactory for preprints in the 
biological sciences, perhaps 


because the general standard of 
quality is inferior. Addressing the 
credibility of these submissions 
could bring free preprint servers 
such as bioRxiv (founded in 2013; 
bioRxiv.org) more into line with 
arXiv (arxiv.org), which has been 
running successfully for 25 years. 

Introducing a staging process 
for submission could rectify the 
credibility problem. This would 
establish priority for new work 
and allow time for it to earn a seal 
of approval — thereby helping 
to speed publication after formal 
submission to a journal. 

Initially, an abstract would 
be assessed by the editors of 
the preprint server to confirm 
suitability for posting. Submission 
ofa short form of the paper would 
follow, with added details of raw 
data, materials and methods; this 
would allow other researchers 
to confirm, refute or comment 
on the results. After 6 months, 
the authors could submit a 
revised preprint in response 
to this feedback, together with 
evidence to support the paper's 
credibility and reproducibility; 
alternatively, they could withdraw 
the manuscript with dignity. 
Roy Calne University of 
Cambridge, UK. 
ryc1000@medschl.cam.ac.uk 


SDGs: diseases that 
neglect no goals 


The mapping of interactions 
between the United Nations’ 
Sustainable Development Goals 
(SDGs) should guide efforts 

to leave no one behind (see 

M. Nilsson et al. Nature 534, 
320-322; 2016). 

Take neglected tropical diseases 
(NTDs; go.nature.com/29te2hu) 
and the SDG target to end 
epidemics by 2030. Doing so will 
have a positive interaction with all 
SDGs, including those directed 
at poverty, hunger, education, 
work, growth and inequality 
(C. Fitzpatrick and D. Engels Intl 
Health 8 (S1), i15—i18; 2016). The 
example set by the largest-ever 
ongoing donation of medicines 
from drug firms (see go.nature. 


com/2ag1gnf) will help to realize 
the public-private partnership 
goal for health. 

Community-led distribution 
of those medicines to more 
than 1 billion at-risk people is 
reinforcing other goals, such 
as women’s empowerment, 
logistics infrastructure and 
non-discrimination against 
disability. Curbing mosquito- 
borne NTDs contributes to the 
goals of urban sustainability and 
resilience to climate change. Safe 
use of insecticides supports the 
goal of sustainable ecosystems. 
Controlling water- and animal- 
related NTDs is facilitating the 
goals of small-scale fishing and 
sustainable hydroelectricity and 
biofuels. 

The World Health Organization 
is monitoring some of the most 
important positive interactions. 
For example, we are using NTDs 
to highlight inequalities, to target 
investment and to track progress 
towards universal access to water, 
sanitation and hygiene (see 
go.nature.com/29tpeph). 
Christopher Fitzpatrick, 
Mathieu Bangert, Dirk Engels 
WHO, Geneva, Switzerland. 
fitzpatrickc@who.int 


SDGs: create a 
coordinating body 


The United Nations’ Sustainable 
Development Goals (SDGs) seem 
to be a disjointed hodge-podge of 
aspirations. I suggest that a ‘panel 
of panels’ would help to guide 
policymakers. This would collate 
the flow of information from 
existing UN panels to highlight 
connections between the SDGs 
(see also M. Nilsson et al. Nature 
534, 320-322; 2016). 

This meta-panel could be 
modelled on international 
science platforms that coordinate 
multilateral environmental 
governance regimes, such as those 
for climate and biodiversity (see 
also P. M. Haas and C. Stevens in 
Governing the Air (eds R. Lidskog 
and G. Sundgqvist) 125-161; 
MIT Press, 2011). It would 
similarly comprise independent 


experts from various countries 
and disciplines, drawn from 
academia, civil society and the 
private sector to capture different 
perspectives on sustainability. 
The panel should be properly 
funded and not depend ona 
single funding source. It would 
meet regularly and report 
directly to governments through 
the UN High Level Political 
Forum. Its findings would need 
to be especially clear in areas of 
consensus and controversy. 
Peter M. Haas University of 
Massachusetts Amherst, USA. 
haas@polsci.umass.edu 


Avery preventable 
mammal extinction 


James Watson blames climate 
change for the extinction of 

the Bramble Cay melomys, or 
mosaic-tailed rat (Melomys 
rubicola; Nature 534, 437; 2016). 
In our view, simple management 
interventions could and should 
have saved this rodent. 

With a population of at most a 
few hundred and a range limited 
toa single, small island (less than 
5 hectares) that is susceptible to 
inundation in storm surges, the 
melomys was already living on 
the brink of extinction. Yet not 
an iota of effort was made, or 
resources committed, to reduce 
its extreme vulnerability. 

Perhaps because the species 
lacked phylogenetic, cultural 
or ecological significance, 
responsible agencies seemed 
to treat it as inconsequential. A 
recovery plan developed in 2008 
did not include plausible options 
for captive breeding or for 
translocation to nearby islands, 
and was never implemented. 

Neglect and inaction caused. 
this extinction — climate change 
was one of many threats that 
could have delivered the final 
blow. Such disregard must not be 
allowed to drive more extinctions. 
John C. Z. Woinarski* Threatened 
Species Recovery Hub, Australia. 
john.woinarski@cdu.edu.au 
*On behalf of 4 correspondents (see 
go.nature.com/2aelgaj for full list). 
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HUMAN PERCEPTION 


Amazon music 


The people of a tribe called the Tsimane’, who have been isolated from Western music, perceive music differently from 
Western listeners, raising questions about whether musical preference is innate or cultural. SEE LETTER P. 547 


ROBERT ZATORRE 


eople have been making music for at 
least 35,000 years’. The remarkable 


diversity of musical expression across 
cultures raises the question of how the brain 
perceives musical sounds. Deep in the Bolivian 
Amazon lives a group of people, the Tsimane; 
whose music is largely isolated from Western 
influences. On page 547, McDermott et al.” 
report that the Tsimane’ perceive harmony — 
the relationship between multiple simultane- 
ous tones — differently from Western listeners. 

Concepts of ‘consonance and ‘dissonance in 
Western music are based on frequency relation- 
ships between simultaneous tones. Periodically 
vibrating sources such as the human vocal cords 
give rise to sound at a fundamental frequency, 
produced by oscillation of the whole source, 
and at integer multiples of that frequency, cor- 
responding to vibrations of half the length of 
the source, one-third its length and so on. This 
physical principle has long been thought to be 
a determinant of how listeners perceive har- 
mony. For instance, tones whose frequencies 
are in simple integer ratios to one another, such 
as 3:2 (called the perfect fifth) or 5:4 (the major 
third), are considered consonant and more 
pleasant than dissonant ones such as 16:15 (the 
minor second). If this hypothesis is correct, then 
all listeners, regardless of their culture, should 
prefer similar tone combinations. 

However, the authors found that the 
Tsimane’ express no preference for consonance 
over dissonance. This result was replicated 
using several different stimuli, even disso- 
nant versions of Tsimane’ songs. In a clever 
control, the experimenters showed that the 
Tsimane’ do prefer pleasant vocalizations, 
such as laughter, to unpleasant gasps, indi- 
cating that they understood what was being 
asked of them. 

On the basis of their findings, McDermott 
and colleagues conclude that preference for 
harmonically related tones is not an innate 
feature of the human auditory nervous system, 
but is instead determined by exposure to music 
that contains harmonic relationships. Such a 
hypothesis provides much food for thought, but 
can be interpreted in several ways. For instance, 
one remarkable — and unusual — feature of 
Tsimane’ music is that only one line is played 
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Figure 1 | A Tsimane’ man makes music. McDermott et al.’ report that a Bolivian tribe of people called 
the Tsimane’ show no preference for consonance over dissonance. This might reflect their musical culture. 


or sung at a time (Fig. 1). Despite repeated 
attempts to encourage Tsimane’ musicians to 
play or sing together, they demurred. It there- 
fore seems likely that the lack of preference 
among the Tsimane’ for harmonically related 
tones is linked to the fact that simultaneous 
tones are irrelevant in their musical culture. 

What is perhaps most surprising is the general 
lack of preference among the Tsimane’ for 
any one combination of tones over any other. 
Although music that contains simultaneous 
dissonant lines is relatively rare across cul- 
tures’, dissonance is often used to produce 
sonic effects or textures’, to create tension 
followed by resolution, or sometimes just 
for its own sake (for example, in contempo- 
rary jazz). Thus, rather than mapping onto a 
simple pleasantness dimension, dissonance 
and consonance may be better thought of as 
ways to manipulate sound expressively, and 
thereby engender emotions or moods. Again, 
the absence of multiple lines in Tsimane’ music 
probably explains why the Tsimane’ do not 
express a preference either way. 

Speech may provide a useful parallel. 
When certain sound contrasts are absent 


from a given language (for example, the dis- 
tinction between ‘r and ‘T sounds in Japa- 
nese), speakers lose the ability to distinguish 
between these sounds after a certain age’. The 
phenomenon represents tuning of the audi- 
tory system to the needs of the environment. 
Could something similar be happening here? 
We do not know how well the Tsimane’ can 
distinguish consonant from dissonant tone 
combinations. The authors did show that the 
Tsimane’ could determine (albeit with some 
difficulty) whether one or two tones were 
present on the basis of mistuning of the har- 
monics; but that does not quite answer the 
question. Lack of exposure to simultaneous 
tones may shape the auditory system such 
that the interactions between harmonics that 
influence preference in Western listeners are 
not processed in the same way, or perhaps 
simply not attended to. 

Research into brain plasticity shows that 
neuronal responses to frequency and other fea- 
tures depend on the sounds that an individual 
is exposed to early in life®. Sensitivity to particu- 
lar features may therefore change depending 
on the musical environment. Plasticity is also 
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relevant in current models of musical pleasure, 
in which cultural influences and experiences 
shape the brains cortical circuitry, which in turn 
influences the brain’s reward system’. 

This explanation is consistent with a strong 
role for environmental effects, in line with the 
authors’ interpretation of their data. But does 
this idea rule out the possibility of innate fac- 
tors? Not necessarily, because, despite their lack 
of preference for harmonically related tones, the 
Tsimane did display a similar dislike to Western 
listeners for roughness — the sensation of sound 
that is elicited by tones that are close together in 
frequency, especially minor and major seconds. 
This finding suggests that there are probably 
some innate biological constraints on which 
environmental input operates. Indeed, audi- 
tory cortical neurons in macaques show differ- 
ential responses to minor and major seconds 
compared with other intervals*; it would be 
hard to argue that this effect is mediated by the 
monkeys’ musical culture. 

The findings also raise the question of how 
or whether harmonic preferences arise when 
a culture develops the use of simultaneous 
sounds. Although the Western harmonic sys- 
tem is definitely not the only way that multi- 
ple sounds can be organized, its principles are 
not arbitrary, and allow for the complex 
relationships known as tonality. Moreover, 
melodic (sequential) frequency relation- 
ships can also be based in part on harmonic 
relationships, perhaps owing to exposure to 
the harmonics of voiced speech’. Tsimane’ 
songs do use musical scales, as do most cul- 
tures’. McDermott et al. also mention that the 
Tsimane’ use major seconds, minor thirds and 
perfect fourths, all of which are based on sim- 
ple integer ratios, and which are also prevalent 
in Western music. So the influence of harmonic 
relations might be expressed in melody, if not 
in harmony. Simple integer ratios are also prev- 
alent in rhythm across cultures* — it would 
be interesting to see whether this feature is 
preserved in Tsimane’ music. 

Although many questions remain, this work 
represents an important contribution to our 
understanding of how the diversity of human 
cultural expression can influence perception. 
More generally, it provides clues to how the 
environment interacts with the nervous system 
to produce all manner of complex behaviours, 
feelings and thoughts. m 
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Unexpected fix for 
ocean models 


Computational models persistently underestimate strong currents that 
redistribute ocean heat. This problem is solved in models in which ocean eddies 
are damped by coupling of the atmosphere with the sea. SEE LETTER P.533 


KATHRYN A. KELLY & LUANNE THOMPSON 


trapped at the western boundary as a con- 

sequence of Earth’s rotation. The currents 
in the Northern Hemisphere carry warm 
subtropical water north along the coast to 
latitudes of about 35-40°N, then turn east to 
form boundary-current extensions. Much of 
the heat carried by these currents is released 
to the atmosphere’, and the remainder pro- 
vides heat to the ocean at higher latitudes. The 
eastward currents simulated by ocean models 
are persistently weaker than those observed. 
On page 533, Ma et al.” show how coupling 
the ocean to the atmosphere in models can 
strengthen the eastward currents, which 
should in turn improve climate predictions. 

In simulations, a weak eastward current 
causes ocean-temperature biases, so that 
water near the coast is warmer than in the 
real world and water in the ocean interior is 
too cold. A weak current will also have a weak 
front (the temperature difference across the 
current is small). This effect has implications 
for the atmosphere, because observational 
analysis* suggests that strong fronts anchor 
mid-latitude storm tracks, whereas weak 
fronts do not. Modelling* that couples the 
ocean to the atmosphere at high atmospheric 
resolution (approximately 50 kilometres) has 
demonstrated the crucial role of ocean fronts 
in driving the atmosphere both in the bound- 
ary layer — the region in which flow is directly 
affected by Earth's surface — and deep into the 
troposphere, the lowest part of the atmosphere, 
where most weather phenomena occur. Ma et 
al. focus instead on how those atmospheric 
modifications feed back to the ocean. 

Weak currents in simulations can be partly 
fixed by increasing the spatial resolution of 
ocean models from approximately 100 km to 
about 10 km. The higher resolution increases 
the ocean’s kinetic energy in the regions near 
the currents to values comparable to esti- 
mates based on observations*. However, the 


| n all ocean basins, strong currents are 


distribution of energy in the ocean-only mod- 
els differs from that observed: in observations, 
more energy is contained in the core of the cur- 
rent extensions than in the eddies that it sheds, 
whereas the reverse is true for the models’. 

Coupling an ocean model to an atmospheric 
model would seem likely to further weaken cur- 
rent extensions, rather than to strengthen them, 
by cooling the warm region south of the cur- 
rent and warming the cold region north of the 
current. Surprisingly, Ma et al. show that such 
coupling can instead strengthen the current in 
high-resolution models. 

Focusing on the Kuroshio Extension, a 
boundary-current extension in the North 
Pacific Ocean, the authors compare coupled 
models that resolve ocean-eddy-scale (about 
10 km) air-sea heat exchange with models that 
do not (Fig. 1). They show that ocean eddies 
drain energy from the current extensions, and 
that coupling of an eddy-resolving atmosphere 
model weakens the eddies by removing heat 
from warm ocean eddies and injecting heat 
into cold ones. An implication of this study is 
that the stronger current simulated by coupled 
models will deliver heat away from the western 
boundary into the ocean interior, as observed, 
and that this will improve climate prediction. 

There are some limitations and further 
important implications of Ma and co-workers’ 
work. The effect of coupling to a high-resolution 
atmospheric model on western boundary cur- 
rents was demonstrated only for the Kuroshio 
Extension region. Although western boundary 
currents all have much in common, such cou- 
pling will not necessarily improve simulations 
inall basins. Moreover, the improvement comes 
at a steep price in terms of computer resources 
and time. The successful result also implies that 
ocean-only models will continue to simulate 
weak currents, because they lack the atmos- 
phere-ocean feedback that lowers the energy 
of ocean eddies*”. 

Accurate simulation of the transport and 
uptake of ocean heat and of atmosphere-ocean 
heat exchange is crucial for climate prediction. 
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Figure 1 | Improved modelling of a strong ocean current. Ma et al.’ 
modelled the Kuroshio Extension, a strong, eastward-flowing current in 
the North Pacific Ocean, using two different versions of a computational 
model. a, In the first version, small-scale (10 km) interactions between 
the ocean and the atmosphere were suppressed by having the atmosphere 
interact with only a ‘smoothed’ version of the ocean. A plot of the 
simulated kinetic energy of the Kuroshio Extension (shown in units 


Most models for century-scale and decadal pre- 
diction in the next assessment of the Intergov- 
ernmental Panel on Climate Change are being 
run at coarse resolution (100 km in the ocean); 
none will be at sufficiently high resolution to 
resolve eddies. Ma and colleagues have isolated 
a specific process that is missing in the lower- 
resolution climate models, and which leads 
to poor representation of ocean currents and 
therefore unrealistic air-sea heat exchange. 

Boundary currents make a large contribution 
to heat transport by the ocean from the tropics 
to the high latitudes, which contributes to mid- 
latitude climate variations, storm-track steering 
and Arctic ice melt*. Accurate representation of 
these strong currents, such as the Gulf Stream, is 
thought to be necessary for climate prediction 
both in the near term (1-20 years)” and on the 
century scale. Ma and co-workers’ study pro- 
vides a strong and specific argument for increas- 
ing the spatial resolution of climate models to 
improve these predictions. m 
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of square metres per seconds squared) suggests a relatively weak, 
meandering current. Land is shown in grey. b, When the authors 

used a version that allows small-scale ocean-atmosphere interactions, 
the Kuroshio Extension is stronger and meanders less than in a, and 
has shifted southward, more closely resembling the observed current. 
These findings help to explain why ocean-only models persistently 
underestimate the strength of such currents. 
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Mitochondrial DNA in 
evolution and disease 


Cellular organelles called mitochondria contain their own DNA. The discovery 
that variation in mitochondrial DNA alters physiology and lifespan in mice has 
implications for evolutionary biology and the origins of disease. SEE LETTER P.561 


DOUGLAS C. WALLACE 


he maternally inherited DNA found 

in cytoplasmic organelles called 

mitochondria encodes the central 
proteins involved in energy production — 
the main function of this organelle. Yet it 
has been assumed that the extraordinar- 
ily high sequence variability of mitochon- 
drial DNA is of little consequence. On 
page 561, Latorre-Pellicer et al.’ dispel this 
erroneous notion. 

The authors transferred mitochondrial 
DNA (mtDNA) from a mouse strain called 
NZB to the nuclear DNA (nDNA) back- 
ground of another strain, C57BL/6, and then 
compared C57BL/6 mice that harboured 
NZB or C57BL/6 mtDNA. The two mtDNA 
sequences differ in genetic variants that confer 


12 amino-acid substitutions and 12 changes 
in RNA molecules involved in mitochondrial 
protein synthesis. Comparison of the mice 
throughout their lives revealed huge differ- 
ences in mitochondrial function, insulin sig- 
nalling, obesity and longevity. This and related 
studies”” clearly demonstrate that naturally 
occurring mtDNA variation is not neutral, and 
that the interaction between mtDNA sequence 
variants and nDNA can have profound effects 
on mammalian biology. 

Why should this be of general interest? It 
turns out that the amount of variation between 
NZB and C57BL/6 mtDNAs is about the 
same as that between two unrelated human 
mtDNAs, so mtDNA variation and its effect 
on nDNA gene expression is also relevant 
to people. 

Mouse and human mtDNA sequences can 
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evolve only by sequentially accumulating 
mutations along radiating maternal lineages. 
For humans, functional mutations arose as 
women migrated out of Africa to colonize 
the rest of the world, modifying their cellular 
energy metabolism and allowing our ances- 
tors to adapt to new regional environmental 
challenges. The mtDNA types (haplotypes) 
that acquired these environmentally advan- 
tageous mutations became widespread in 
their respective environments to give rise to 
regional groups of related haplotypes, called 
haplogroups. This regional selection explains 
why, of all the mtDNA lineages that evolved in 
Africa over the first 100,000 years of human 
history, only two mtDNAs (dubbed M and N) 
successfully left Africa 65,000 years ago to 
colonize the rest of the world. It also explains 
why only N mtDNAs colonized Europe, 
whereas both M and N colonized Asia, and 
why only five mtDNAs colonized the Americas 
(reviewed in ref. 4). Because human mtDNA 
diversity evolved from a single mtDNA, func- 
tional mtDNA variation that allows regional 
population isolation may also contribute to 
speciation’. Hence, mtDNA haplogroups 
are fundamental to the biology of both mice 
and men. 

Because mitochondria have a bioenergetic 
role, it makes sense that mtDNA variation 
affects our physiology and our ability to adapt 
to environmental change. Variation in mtDNA 
genes can permit accommodation to new diets 
or adjustment to thermal stress and activity 
demands, and can even alter the regulation of 
cell death*. The correlation between human or 


mouse mtDNA variation and a broad range of 
traits, including longevity, physical capacity 
and, in humans, predisposition to a wide spec- 
trum of metabolic and degenerative diseases 
and forms of cancer*’, confirms the functional 
importance of mtDNA variation. 

Nuclear gene expression is also affected 
by mtDNA variation, owing to the role of 
the mitochondrial energy-production sys- 
tem in modulating the levels of high-energy 
molecules generated through mitochondrial 
metabolism. High-energy mitochondrial 
metabolic products, such as the molecules 
ATP, acetylCoA and a-ketoglutarate, drive 
the modification of cytoplasmic signalling 
proteins and also add molecular modifica- 
tions to nuclear proteins, which, together 
with nDNA modifications, constitute the 
epigenome. Changes in cellular signalling 
and the epigenome regulate nDNA gene 
expression. This coupling between the mito- 
chondrion and the nucleus is crucial because 
no cellular function can proceed without 
sufficient energy. The nucleus must ‘know’ 
that mitochondria can generate the required 
energy before proceeding with DNA replica- 
tion and transcription, for example. 

Studying cells that have the same nucleus 
but different levels of a pathogenic mtDNA 
mutation — a change in an RNA molecule in 
which nucleotide 3243 is guanine (3243G) 
instead of the normal adenine — has helped 
to define the nature of mtDNA-nDNA 
interactions. Each cell contains hundreds of 
mtDNA copies, so both mutant and normal 
mtDNAs can be present in the cell in different 
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Figure 1 | Mitochondria as the central environmental sensor. Latorre-Pellicer et al.' report that the 
transfer of mitochondrial DNA (mtDNA) from one mouse strain to another has pronounced effects 

on biology, demonstrating that mitochondrial genetic variation is not neutral and that mitochondrial- 
nuclear interactions are of central importance to mammalian physiology. Mitochondrial function 

is directly influenced by environmental changes, so the mitochondrion must have a central role in 
mediating between environmental perturbations and genomic responses. High-energy molecules 
produced by mitochondria modify the cytoplasmic signalling proteins and ‘epigenomic proteins that 
regulate nuclear DNA (nDNA) expression. These changes reprogram gene expression, altering expression 
of nDNA- and mtDNA- derived proteins that act in and on the mitochondria — these alterations feed 
back on mitochondrial function. If energetic homeostasis can be re-established, health and longevity are 
preserved. However, if genetic or environmental changes are too extreme, energy production declines, 


leading to disease and even death. 
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proportions, a state known as heteroplasmy. 
When the 3243G mutation is present at a fre- 
quency of 10-30%, patients can manifest dia- 
betes or, in rare cases, autism; at 50-90%, the 
mutation manifests as neurological, heart and 
muscle problems; and at 100%, it can result 
in childhood disease and death. A study° of 
nuclear gene expression in cell lines contain- 
ing different percentages of the 3243G muta- 
tion revealed that each of these clinical classes 
of heteroplasmy is associated with a distinct 
nuclear gene-expression profile. In humans, 
then, as in Latorre-Pellicer and colleagues’ 
mice, subtle changes in mitochondrial func- 
tion resulting from mtDNA variation can have 
profound effects on nuclear gene expression, 
cellular physiology and individual health. 

Uniparental inheritance of mtDNA is 
almost universal among animals, and mtDNA 
lineages are functionally different, so it might 
be predicted that artificially mixing two differ- 
ent mtDNA haplotypes in the same cell could 
be deleterious. Consistent with this prediction, 
mixing NZB mtDNAs with mtDNAs from a 
strain called 129 (whose mtDNA is similar 
to that of C57BL/6) in C57BL/6 mice results 
in mice that are hypo-active, hyper-excitable 
and have severe long-term-memory defects’. 
Hence, biparental inheritance of different 
mtDNAs can be deleterious, and the NZB 
and 129-C57BL/6 mtDNAs are functionally 
different. 

Because mtDNA haplogroup variation has 
been shaped by environmental selection, it 
follows that mitochondria might be key sen- 
sors of environmental changes. Thanks to our 
growing understanding of the many regulatory 
roles of mitochondria, a holistic picture of cell 
biology has started to emerge (Fig. 1). In this 
scenario, changes in the environment would 
affect mitochondrial bioenergetics and alter 
the production of high-energy mitochondrial 
molecules. Altered concentrations of these 
mitochondrial molecules drive the chemi- 
cal modification of cytoplasmic-signalling 
and epigenomic proteins, reprogramming 
nDNA gene expression. The altered nDNA 
gene-expression status then feeds back to 
modify mitochondrial gene expression and 
re-establish energetic homeostasis and health. 
However, if the local environmental changes 
are too severe to be managed by an indi- 
vidual’s mtDNA-haplogroup-defined physi- 
ological state, or if recent deleterious mtDNA 
mutations are too pathogenic, this feedback 
homeostasis system may fail, resulting in 
progressive energetic decline, disease and 
ultimately death. 

Latorre-Pellicer and colleagues’ study 
provides direct evidence that naturally occur- 
ring mtDNA variation is fundamental to an 
individual’s characteristics and health. This 
information supports a model whereby mito- 
chondrial physiological states determined 
by mtDNA variation sense environmental 
perturbations and send the appropriate 
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signals to the nucleus to produce the optimal 
gene-expression response. ™ 
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Dynamo theory 


questioned 


Observations of X-ray emission — a diagnostic tool for the mechanisms driving 
stellar magnetic fields — from four cool stars call into question accepted models 
of magnetic- field generation in the Sun and stars. SEE LETTER P.526 


PAUL CHARBONNEAU 


very star, including the Sun, hosts a 
Bones field. One of the most nota- 

ble products of stellar magnetic activ- 
ity is X-ray emission (Fig. 1), most of which 
comes from active regions — areas on a star’s 
surface where the magnetic field concentrates 
and whose best-known examples are sunspot 
groups. The commonly accepted theory of 
stellar magnetic-field generation is based ona 
dynamo process of electromag- 
netic induction: the mechanical 
energy of internal and surface 
plasma flows is converted into 
magnetic energy. According to 
this theory, less than one-tenth 
of 1% of the Sun’s total luminos- 
ity is sufficient to drive the solar 
magnetic cycle, heat the corona, 
accelerate the solar wind and 
power all the eruptive phenom- 
ena that collectively make up 
solar activity. 

But on page 526, Wright and 
Drake’ show that the details of 
the dynamo theory still escape 
us. They report on the X-ray 
emissions from four stars 
detected by the Chandra and 
ROSAT space observatories. 
These stars are substantially 
colder than the Sun and have 
a different internal struc- 
ture. If the chain of physical 
mechanisms that lead to X-ray 
emission in the Sun conformed 
to the dynamo theory, we would 
expect a different pattern of 
X-ray emission from these stars. 
Yet, Wright and Drake show that 
their X-ray emissions have the 
same pattern as the Sun’s. 
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The solar-surface magnetic field can be 
observed in great detail because of the Sun’s 
proximity to Earth. Such observations have 
revealed several spatiotemporal patterns 
that are driven by the dynamo. Perhaps most 
notably, polarity reversals occur with a regu- 
lar 11-year period, and magnetically active 
regions are found to emerge ever closer to the 
solar equator as activity cycles unfold’. Con- 
versely, observations of the activity of other 
stars are usually restricted to surface-averaged, 


Figure 1 | Short-wavelength emission from the Sun. In this composite 

image, extreme ultraviolet (red-yellow) was measured by the Solar Dynamics 
Observatory; low-energy X-rays (green) were measured by the Hinode spacecraft; 
and high-energy X-rays (blue) were measured by the Nuclear Spectroscopic 
Telescope Array. Wright and Drake’ present observations of X-ray emissions 

from four stars that cannot be explained by currently accepted models of the solar 
dynamo. 
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global measurements that lack spatial 
information. 

But, where stars are concerned, what 
researchers lack in detail is made up for in 
numbers. Surveys of X-ray emissions from 
large samples of stars — which, among other 
things, determine the dependence of the 
emissions on stellar mass, luminosity and 
rotation rate — provide information on global 
stellar activity and, in turn, insight into the 
underlying magnetic processes. 

In Sun-like stars, X-ray emission increases 
with rotation rate up to a value of a few times 
the rotation period, and then levels off’. 
Information about the details of the internal 
dynamo is thus lost for fast-rotating stars, 
which fall within this plateau region of emis- 
sion behaviour. Wright and Drake present an 
updated version of the emission-behaviour 
profile (see Fig. 1 of the paper’), based on data 
for hundreds of stars of varying masses and 
luminosities. 

Within the solar dynamo theory, a key 
element of the complex causal chain that 
links the internal dynamo to surface X-ray 
emission is the tachocline, 
a transition region between 
the radiative core of the Sun 
and its convective outer layer. 
Solar plasma in the convec- 
tive zone rotates at different 
rates, depending on latitude, 
whereas the radiative core 
rotates more or less as a solid 
body. This difference in rota- 
tional rate between the two 
zones produces a strong 
‘shear’ stress in the tachocline, 
which helps to concentrate 
the diffuse magnetic field into 
structures called flux ropes. 
These magnetic flux ropes 
emerge at the surface and 
generate active regions’. 

Stars progressively less 
massive than the Sun have 
deeper convective envelopes, 
becoming fully convective at 
about 40% of the Sun’s mass. 
Such stars no longer have a 
tachocline, so we should expect 
some qualitative change in their 
mode of dynamo action. This 
long-sought ‘dynamo bound- 
ary’ has not yet been detected. 
Fully convective stars have been 
found to emit the same level 
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of X-rays as solar-type stars that harbour a 
tachocline. However, all previously reported 
stars of this type are fast rotators, and therefore 
fall on the plateau region of the diagram that 
plots the relationship between X-ray emission 
and rotation rate (see the red circles in Fig. 1 
of the paper’), meaning that the details of the 
internal dynamo action might well be lost. 

Enter Wright and Drake. The authors have 
uncovered the X-ray emissions from four 
slowly rotating, fully convective stars, and 
all four fall on the slope part of the X-ray 
emission-behaviour profile. X-ray emissions 
therefore scale identically with the Rossby 
number (the ratio of convective flow speed 
to rotation rate) in stars with and without 
a tachocline.The authors thus argue that the 
tachocline cannot be an essential ingredient for 
stellar dynamo action, as it is in the currently 
accepted theory. 


MICROBIOLOGY 


There are several possible ways to explain 
this quandary. Perhaps low-mass stars are not 
fully convective all the way down to their cen- 
tres. Or maybe the pattern of stellar X-ray emis- 
sion is dominated by a re-organization of the 
magnetic field that occurs in the stellar upper 
atmosphere’, thus losing its ‘memory’ of the 
magnetic field’s dynamo origin. And, of course, 
a tachocline might indeed be non-essential. 

Numerical simulations®” of global solar 
convection have demonstrated that solar-like 
large-scale magnetic fields undergoing regu- 
lar polarity reversals can be produced wholly 
within a convection zone, without the need 
to extend the simulation down to the depth 
of the tachocline. Some of these simulations 
even generate rope-like structures of mag- 
netic flux that rise to the top of the simulation 
domain in a solar-like manner*’. Wright and 
Drake's results, together with such simulations, 


Antibiotics right 
under our nose 


Bacteria that are normally resident in the body have many roles in supporting 
health. Researchers have now identified a bacterial resident of the nose that 
produces an antibiotic that is active against a pathogen. SEE ARTICLE P.511 


KIM LEWIS & PHILIP STRANDWITZ 


here is immense clinical concern about 
the rise of antibiotic-resistant ‘super- 
bugs’ — such as strains of the bacterium 
Staphylococcus aureus known as methicillin- 
resistant S. aureus (MRSA) that have devel- 
oped resistance to several key antibiotics. 
Faced with the growth of resistant strains of 
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Figure 1 | An approach to identifying natural antibiotics. Zipperer et 
al.' used co-occurrence analysis of the human microbiota (the bacterial 
community in the body) to identify bacteria that do not co-occur with a 
target pathogenic bacterium. These candidate competitor bacterial strains 
were tested individually for the ability to inhibit growth of the pathogen, 


bacteria, finding more antibiotics is an urgent 
necessity. Most antibiotics have been isolated 
from soil-living bacteria, but on page 511, Zip- 
perer et al.'identify an antibiotic produced by 
a bacterial resident of the human nose that is 
active against strains of MRSA. 
Staphylococcus aureus is an opportunistic 
human pathogen and, in addition to causing 
MRSA infections, is responsible for many 
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provide an impetus to rethink what we know 
about the solar and stellar dynamo. = 
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infections, including those of the bloodstream 
(bacteraemia) and the lining of the heart 
(endocarditis), or infections surrounding pros- 
thetic implants. S. aureus is found in the noses 
of around 30% of the population. This has 
prompted efforts to decolonize the nose with 
various antibiotics” — a controversial prac- 
tice, considering efforts to reduce antibiotic 
use. But how is it that 70% of the population 
resist colonization by S. aureus? 

The presence of several nasal bacteria, 
including other Staphylococcus species, is 
negatively correlated with the presence of 
S. aureus. But the mechanism that underlies 
this distribution pattern — and the presumed 
inhibition of S. aureus invasion (known as col- 
onization resistance) — has been a mystery’. 
To investigate this potential bacterial anta- 
gonism, Zipperer and colleagues screened 
90 Staphylococcus samples from the human 
nasal cavity, including many Staphylococcus 
species, to test for S. aureus growth inhibition. 

One strain, Staphylococcus lugdunensis, 


and those that did were then cultured and screened to see whether they 
conferred colonization resistance in an animal model. An antibiotic 
compound responsible for pathogen inhibition was isolated by the authors, 
and the compound, or the microbes that produce it, might be developed as 
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cleared a growing population of S. aureus, 
suggesting that the strain secretes a com- 
pound that causes the breakdown (lysis) of 
S. aureus. Using a library of S. lugdunensis 
mutants in which individual genes had been 
knocked out, the researchers identified a 
single mutant that was unable to inhibit S. 
aureus. Studying this mutant enabled iden- 
tification of the compound responsible 
for S. aureus inhibition — a peptide anti- 
biotic that the authors named lugdunin. 
Zipperer and colleagues’ findings suggest 
that treatment with S. ugdunensis (or use of 
lugdunin) may be a valuable tool for the pre- 
vention of S. aureus colonization in the clinic. 

It may seem surprising that a member of 
the human microbiota — the community of 
bacteria that inhabits the body — produces an 
antibiotic. However, the microbiota is com- 
posed of more than a thousand species’, many 
of which compete for space and nutrients, and 
the selective pressure to eliminate bacterial 
neighbours is high. This intense competition 
can be observed by surveying patterns of bac- 
terial communities in which the presence ofa 
particular species is negatively associated with 
the presence of other species’. 

One probable mechanism for these negative 
interactions is the production of antibiotics. 
Although conventional antibiotic discovery 
has focused on mining compounds from 
soil-living bacteria, genome-wide analysis of 
the human microbiome has identified many 
gene clusters that encode enzymes associated 
with antibiotic production, such as polyketide 
synthases or non-ribosomal peptide synth- 
etases. However, only a few such antibiotics 
have been characterized in the laboratory: 
strain-specific bacterial inhibitory molecules 
called bacteriocins”; fairly large peptide anti- 
biotics called lantibiotics*; and the antibiotic 
lactocillin, which is active against several 
pathogenic bacteria’. Until now, evidence 
linking these compounds to the competi- 
tive advantage that they may confer has been 
lacking. The current study fills this gap. 

Zipperer and colleagues found that lugdunin 
is active against several pathogens, includ- 
ing MRSA strains and Enterococcus bacteria 
that are resistant to the antibiotic vancomy- 
cin — and the bacteria treated with lugdunin 
did not develop resistance to it. In both 
in vitro and in vivo studies in mice, the ability 
of S. lugdunensis to outcompete S. aureus 
depends on the presence of a functional 
lugdunin biosynthesis pathway. 

When the authors analysed 187 hospital- 
ized patients, they found that S. aureus colo- 
nization was just 5.9% in individuals who 
carry S. lugdunensis, compared with 34.7% in 
individuals without S. lugdunensis. When pro- 
duced by a bacterium that occupies a confined 
niche, it is clear that lugdunin and similar anti- 
biotics have considerable power to influence 
bacterial community structure. Given that 
S. lugdunensis is present in only around 10% of 
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the population and S. aureus is found in about 
30% of the population, there are probably more 
antibiotics yet to be discovered that are respon- 
sible for S. aureus colonization resistance. 
Lugdunin was found to be effective as 
a topical agent (applied to the surface) for 
treating an S. aureus skin infection in mice. 
Considering that lugdunin can inhibit the syn- 
thesis of major biopolymers (proteins, DNA 
and peptidoglycans) in S. aureus, it is probably 
a membrane-acting antibiotic, which would be 
challenging to develop into a systemic thera- 
peutic because such compounds also tend to 
disrupt membranes of mammalian cells. 

The method of discovery of lugdunin 
reported by Zipperer and colleagues provides 
a general approach to investigating antibiotic- 
driven colonization resistance of the human 
microbiota against pathogens (Fig. 1). By 
combining genomic and co-occurrence data, 
bacteria can be identified that are negatively 
correlated with the presence of a pathogen and 
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that have the potential to produce antibiotics. 
These organisms, or the antibiotics they pro- 
duce, might serve as drug-discovery leads. = 
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Thirty-five years of 
endless cell potential 


Pluripotent cells have the potential to differentiate into any cell type in the body. 
Their isolation and propagation from mouse embryos was pivotal for advances in 
understanding human development and disease. 


M. AZIM SURANI 


he human body consists of around 

250 differentiated cell types, which 

all originate from about 15 cells ina 
structure called the inner cell mass that arises 
at the blastocyst stage of early embryonic 
development. These cells can each give rise 
to every cell type, but this pluripotent status 
is transient, with the cells soon differentiating 
into diverse types. This year marks the 35th 
anniversary of landmark studies by biologists 
Martin Evans and Matthew Kaufman!, and 
Gail Martin’, who independently captured 
and propagated these elusive pluripotent 
cells in a Petri dish for the first time, nam- 
ing them embryonic stem (ES) cells. Studies 
of isolated ES cells have revolutionized our 
understanding of mammalian development, 
and enabled the simulation of human diseases 
in vitro (Fig. 1). 

In the 1950s, it was reported’ that mice, like 
humans, could develop testicular tumours 
called teratomas, which were derived from 
germ-cell precursors (precursors of eggs or, 
in this case, sperm) and contained various dif- 
ferentiated cell types not normally found in 
testes. The teratomas also contained 


embryonic carcinoma (EC) cells, which had 
key attributes of pluripotency. 

However, EC cells could not efficiently 
differentiate into normal tissues if they were 
injected into blastocysts. In particular, their 
differentiation into sperm or eggs was not 
observed, meaning that EC cells could not be 
used to generate strains of mice that could pass 
experimentally induced mutations on to their 
offspring — a major goal of research into pluri- 
potency. This shortcoming encouraged efforts 
to find other sources of pluripotent stem cells. 

Evans and Kaufman established ES cells 
directly from mouse blastocysts that had been 
prevented from undergoing further develop- 
ment, for example by treating the mothers with 
steroids. Meanwhile, Martin grew inner cell 
masses from late-stage blastocysts using cul- 
ture media in which EC cells had been grown 
previously, in the hope that the EC cells might 
have released factors conducive to the growth of 
ES cells. Both studies were successful, leading to 
the first isolations of blastocyst-derived ES cells 
that could be maintained indefinitely in vitro. 

ES-cell isolation had an immediate and 
profound impact on the study of mamma- 
lian development. Unlike EC cells, ES cells 
could reliably give rise to germ cells’, 
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Figure 1 | Exploiting the potential of embryonic stem cells. In 1981, 
Evans and Kaufman’ and Martin’ independently isolated embryonic stem 
(ES) cells, which can give rise to all cell types, from the inner cell masses of 
mouse embryos at the blastocyst stage of development. During the 1980s, 
ES cells carrying experimentally introduced mutations were put back into 
blastocysts — some of the mutated cells developed into sperm and eggs, 
enabling the production of mutant mouse strains. In the late 1990s, human 


making it possible to introduce specific genetic 
mutations into mouse strains. To briefly 
outline the process, ES cells are genetically 
engineered to harbour a specific mutation, and 
are then transplanted into a host blastocyst. 
The resulting embryo is a chimaera — some 
of its cells will contain the mutation, others will 
not. If germ cells carry the mutation, it can be 
passed to the offspring. The consequences of 
the mutation can then be examined in detail. 

A great many mutant mouse strains have 
been generated in this way, enabling detailed 
analyses of the functions of individual genes 
under physiological conditions. Genes vital 
for embryonic development, especially those 
that establish the body plan, were mutated to 
study their precise role’. Moreover, human 
diseases could be modelled in other mammals. 
For example, a gene that causes cystic fibrosis 
in humans was mutated in mouse ES cells®, and 
mice carrying this mutant gene were then cre- 
ated to provide an animal model for the disease. 

The development of large-scale mutation 
strategies such as gene traps (in which muta- 
tions are introduced into DNA at random 
through the insertion ofa genetic construct that 
producesa ‘reporter’ protein) led to the produc- 
tion of mutation libraries — a useful resource 
of cells from which mice carrying particular 
mutations could be made. Before the isolation 
of ES cells, all such research was limited by the 
availability of naturally occurring mutations. 
But today, any gene can be investigated at will. 

Following this impressive progress, attention 
shifted to humans. In 1998, human ES (hES) 
cells were eventually isolated’. Although 
not molecularly identical to mouse ES cells, 
hES cells nonetheless possess the key prop- 
erties of pluripotency — self-renewal and an 
ability to differentiate into diverse lineages. 
Moreover, they provide opportunities for 
studying early human development. 

Ethical considerations prevent hES cells 
from being used for germline therapy, in which 
genetic changes would be introduced into hES- 
derived germ cells with the aim of correcting 
faulty genes. However, two studies published 
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last year®*” showed that germ-cell precursors 
can be derived from hES cells, paving the way 
for studies of human germ cells and indicat- 
ing that it might one day be possible to make 
sperm and eggs in culture. Some key genes 
involved in the induction of human germ-cell 
precursors, however, are not involved in mouse 
germ cells*’. Thus, these studies, with many 
others, have revealed key differences between 
mouse and human development. 


i Studies on ES cells 
: ES -cell also defined the four 
isolation had an key regulatory genes 
immediate and that control pluri- 
profound impact potency, which in 
on the study of 2006 led to a major 
mammalian advance — the con- 
development.” version of differ- 


entiated cells into 

induced pluripotent stem (iPS) cells'®’. 
iPS cells have since been derived from patients 
and differentiated into specific cell types, to 
simulate the initiation and progression of 
disease in culture. Moreover, iPS cells are cur- 
rently being used to develop in vitro models 
for testing potential therapeutic compounds 
to treat diseases such as Alzheimer’s. Human 
ES cells and iPS cells might some day be used 
as part of cell-replacement therapies in dis- 
eased tissues, although the greatest advances 
in their use will initially probably be in basic 
research and in screens for therapeutic agents. 
One useful feature of ES cells is their ability 
to self-organize into differentiated tissues and 
to assemble into structures that have recogniz- 
able attributes of organs. In the past decade, 
this quality has been exploited to produce 
synthetic organs called organoids in culture. 
So far, human organoids have been generated 
that model the brain, intestine, liver and kid- 
ney, among others'*"’*. More-advanced 3D 
models that approximate physiological set- 
tings are likely to be developed in the future 
using tissue-engineering methods. Attempts 
are now under way to generate human organs 
in animals, by incorporating hES cells into 
blastocysts from other species. This might 
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ES cells were first isolated. In the 2000s, study of ES cells from both species 
led to the direct reprogramming of adult cells into induced pluripotent 
stem (iPS) cells. iPS cells have since been used to simulate the progression 
of human diseases in culture. Since 2008, iPS and ES cells have also been 
used to generate synthetic in vitro structures called organoids, which share 
many of the properties of organs in vivo. In the future, ES cells may be used 
to grow human organs in host animals. 


alleviate the unmet demand for organs for 
transplantation. 

Advances made following the isolation of 
pluripotent ES cells have coincided with pro- 
gress in other technologies, such as live imag- 
ing, single-cell analysis and genome editing”. 
When combined to study human organoids, 
these approaches will allow analysis of the 
outcomes of increasingly sophisticated 
manipulations of genes and of the genetic 
elements that regulate their expression. Such 
studies will provide unprecedented under- 
standing of how genes individually and 
collectively regulate human development, and 
how such regulation can go awry. 

The history of ES cells shows how basic 
research has led to revolutionary advances. 
The future presents us with endless opportu- 
nities to explore how a single cell transforms 
into a multicellular adult organism. m= 
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Rapid signalling in distinct dopaminergic 
axons during locomotion and reward 


M. W. Howe! & D. A. Dombeck! 


Dopaminergic projection axons from the midbrain to the striatum are crucial for motor control, as their degeneration 
in Parkinson disease results in profound movement deficits. Paradoxically, most recording methods report rapid phasic 
dopamine signalling (~100-ms bursts) in response to unpredicted rewards, with little evidence for movement-related 
signalling. The leading model posits that phasic signalling in striatum-targeting dopamine neurons drives reward -based 
learning, whereas slow variations in firing (tens of seconds to minutes) in these same neurons bias animals towards 
or away from movement. However, current methods have provided little evidence to support or refute this model. 
Here, using new optical recording methods, we report the discovery of rapid phasic signalling in striatum -targeting 
dopaminergic axons that is associated with, and capable of triggering, locomotion in mice. Axons expressing these 
signals were largely distinct from those that responded to unexpected rewards. These results suggest that dopaminergic 
neuromodulation can differentially impact motor control and reward learning with sub-second precision, and indicate 
that both precise signal timing and neuronal subtype are important parameters to consider in the treatment of dopamine- 


related disorders. 


Dopaminergic projections from the substantia nigra pars compacta 
(SNc) to the striatum are vital for motor control, and their degenera- 
tion in neurodegenerative disorders such as Parkinson disease results 
in severe motor problems?-?. However, little evidence for movement- 
related dopamine signalling has been observed; instead, most recording 
methods report that unpredicted rewards induce phasic signalling 
(hundreds-of-millisecond bursts), which are superimposed on slowly 
varying background activity*”'°. The leading model to explain the 
dual role of dopamine in reward and movement posits that phasic 
bursts in striatum-targeting dopamine neurons act to drive reward- 
based learning, while slow variations in firing (tens of seconds to 
minutes) in these same neurons bias the system towards or away from 
movement!!-!>, However, there is little evidence for or against this 
model. 

Most information about in vivo dopamine dynamics comes from 
extracellular single unit recordings in the ventral tegmental area (VTA) 
and SNc**!6-2! (Supplementary Discussion). This approach provides 
functional characterization of individual neurons with single-spike 
temporal resolution, but it cannot discriminate neurons by their 
projection targets. Moreover, midbrain single unit recordings alone 
cannot reliably discriminate dopaminergic from non-dopaminergic 
neurons®” and are blind to the local control exerted on dopamine 
terminals in the target regions***. Fast-scan cyclic voltammetry, on 
the other hand, measures changes in dopamine release directly in striatal 
terminal regions, but these measurements of extra-synaptic dopamine, 
averaged over hundreds of micrometres, are insensitive to synaptically 
confined dopamine and potentially heterogeneous signalling across 
different axons”°. To measure both phasic (hundreds of millisecond) 
and slowly varying (tens of seconds to minutes) dopamine signalling 
in the striatum during reward delivery and animal movement, and 
to determine whether dopamine projections to the striatum broadcast 
homogenous or heterogeneous signals, we used two-photon calcium 
imaging and fibre photometry to establish the striatal dopamine signalling 
patterns occurring in locomoting mice receiving unpredicted rewards. We 
then tested the role that these activity patterns play in motor control by 
optogenetically stimulating striatum-targeting dopamine axons. 


Locomotion related dopamine signalling 

A new approach was established to image the activity of dopamine 
projection axons in dorsal striatum”® (Fig. la-d). DAT-cre mice 
(n=6) were injected in the midbrain with adenovirus AAV 1-Syn-flex- 
GCaMP6f?’, leading to specific expression of GCaMPéf in dopamine 
neurons widespread throughout SNc and VTA (Fig. 1d). A chronic 
imaging window was then implanted over dorsal striatum (Extended 
Data Fig. 1b) to enable two-photon imaging”® (Methods, Extended Data 
Fig. 2i) of the GCaMP6f-expressing dopaminergic projection axons 
(Fig. 1c). Axons were clearly visible, but were densely packed (Fig. 1c, 
Extended Data Fig. 1a), presumably representing overlapping axonal 
arbors from numerous different dopamine neurons”®. Mice were then 
head-restrained over a cylindrical treadmill, and time-series videos of 
the labelled axons were acquired (in the dark) during rest, self-initiated 
locomotion, and/or delivery of unexpected rewards (Fig. le). 

Calcium influx into axons and terminals, caused by axonal action 
potentials”?° (with local modulation possible’>™), triggers neurotrans- 
mitter release*!~>4, and therefore, changes in GCaMP6f fluorescence?’ 
provide an indirect measure of dopamine release in dorsal striatum**. 
These GCaMPé6f fluorescence changes are well-suited for deter- 
mining whether mouse locomotion (100s of millisecond timescale 
accelerations) is associated with rapid phasic dopamine signalling 
(sub-second-duration GCaMP6f transients), slowly varying dopamine 
signalling (tens of seconds to minutes-duration transients or changes 
in baseline) or both (Methods, Extended Data Fig. 3a-c). 

As a first metric to quantify dopamine signalling in dorsal striatum, 
we summed the fluorescence from all dopamine axons in each 
frame of each imaging field (60-250 1m diameter fields sampled at 
28 Hz) to generate a mean fluorescence activity trace (mean change 
in fluorescence, AF/F trace; Fig. le, bottom). These traces revealed 
frequent transients during treadmill locomotion (Fig. le), and these 
varied in duration and amplitude but generally occurred on sub-second 
timescales (duration 678 + 460 ms (mean +s.d.); amplitude 126 + 67% 
AF/F; Extended Data Figs 2g, h, 3e). Longer duration transients often 
contained multiple peaks, as expected from summation of closely 
spaced, short duration transients?” (Fig. le, Extended Data Fig. 3). 
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Figure 1 | Locomotion related signalling in dorsal-striatal-projecting 
dopamine axons. a, Schematic of methods. b, Mouse running on 
treadmill under microscope. c, Representative field of GCaMP6f- 
labelled dopamine axons in dorsal striatum. DAT, dopamine transporter. 
d, GCampé6f expression in dopamine neurons (green) overlaid with 
tyrosine hydroxylase (TH) immunofluorescence (red). Right insets 

are at high magnification. e, Average whole-field AF/F fluorescence 
from a representative field in one mouse (bottom, n = 6 mice total) and 
corresponding treadmill velocity (top) and acceleration (middle) during 
locomotion. f, Comparison of mean baseline (periods with no significant 
calcium transients) whole-field AF/F between locomotion and resting 
periods. g, Comparison of mean whole-field fluorescence change 

(mean of significant calcium transients, excluded baseline periods) 
between locomotion and resting periods. f, g, Each point represents 
mean AF/F for running or resting over one session for each field (n = 17 
sessions from 6 mice, lines connect same field/session, short lines: mean 
over sessions); **P < 107°, Wilcoxon rank-sum test. NS, not significant 
(P>0.05). 


Transients were typically synchronous across the densely innervated 
fields, presumably reflecting widespread co-activation of many 
projection axons (Extended Data Fig. 1), and were more frequent 
during spontaneous locomotion than resting (non-reward) periods 
(Fig. 1g, Extended Data Fig. 4h; P= 1.8 x 1074, Wilcoxon sign-rank test 
across all fields, n =6 mice). However, no significant difference in the 
fluorescence baseline (periods with no significant calcium transients) 
was found between locomotion and resting (Fig. 1f; P=0.45, Wilcoxon 
sign-rank test across all fields, n =6 mice). Moreover, the duration of 
nearly all significant dopamine transients (n = 2,087) was far less than 
the duration of the average locomotion bout (mean locomotion bout 
10.3 + 11.2, Extended Data Fig. 3e). Thus in dorsal striatum, treadmill 
locomotion is associated with widespread and synchronous sub-second 
transient increases in GCaMPé6f fluorescence in dopamine axons, 
suggestive of phasic dopamine release, but is not associated with 
detectable long-duration transients or sustained changes in 
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baseline fluorescence levels, suggesting little slowly-varying dopamine 
release. 


Dopamine signalling to acceleration bursts 

We first examined whether the widespread fluorescence transients 
in dorsal striatum displayed a timing preference with respect to 
locomotion initiations on the treadmill (sharp rest-run transitions; 
Fig. 2a, d, f, Extended Data Fig. 4a, b). On average and at indi- 
vidual onsets, GCaMP@6f transients increased rapidly just before 
(~100-150 ms) acceleration onsets occurring at locomotion initia- 
tion (Fig. 2a, Extended Data Fig. 4a, j, 1, 0; cross-correlation peak at 
—143 £250 ms, significantly different from 0 lag, P=0.003, Wilcoxon 
sign-rank test across peak correlation times, Fig. 2d, f). Thus, signalling 
across a widespread population of dopaminergic axons leads locomo- 
tion initiation in dorsal striatum, suggesting that sub-second dopamine 
signalling has a role in initiating locomotion bouts. 

We then examined whether the widespread fluorescence 
transients displayed a timing preference with respect to discrete 
phases of continuous mouse locomotion movements. Movements 
consisted of rhythmic acceleration bursts on the treadmill over a 
broad frequency range centred on ~2.5 Hz (Extended Data Fig. 4e). 
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Figure 2 | Phasic dopamine signalling displays a sub-second timing 
preference with respect to acceleration bursts. a, Top, mean whole-field 
AF/F triggered on accelerations (accel) at locomotion initiations; each row 
is mean for each field/session (AF/F normalized for each row and sorted 
by peak time). Bottom, mean acceleration (black) and whole-field AF/F 
(green) all triggered on locomotion initiations (mean across all fields). 

b, Same as a, except triggered on accelerations during continuous 
locomotion periods. c, Same as a, except triggered on locomotion 
terminations; note lack of AF/F transient peak (arrow) after final 
acceleration. d, Correlations between acceleration and whole-field 
fluorescence at different relative time-lags (that is, cross-correlations) 
during locomotion initiation periods (sorted by peak correlation time); 
each row is mean for each field/session. Corr coeff, correlation coefficient. 
e, Same as d, but during continuous locomotion periods. f, Mean cross- 
correlations between acceleration and whole-field fluorescence for 
initiation (blue) and continuous (red) locomotion periods. g, Mean 
acceleration (black) triggered on all short duration calcium transients 
(green, mean of transients) during continuous locomotion across all fields. 
Shaded regions in a-c, f, g denotes mean + s.e.m.; n = 6 mice in ag. 
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Indeed, fluorescence transients displayed a delayed timing preference 
with respect to acceleration bursts during continuous locomotion 
(Fig. 2b, e, f, Extended Data Fig. 4c, d), where on average they 
followed the onsets (zero crossing) of individual acceleration bursts 
(Fig. 2b,g; cross-correlation peak at 215 + 90 ms, significantly different 
from 0 lag, P= 1.66 x 10-4, Wilcoxon sign-rank test across 18 fields; 
Fig. 2e, f, Extended Data Fig. 5a, b; n= 6 mice), an effect that was also 
clearly observed with individual isolated transients (Extended Data 
Fig. 4c, k). Importantly, the acceleration-associated calcium transients 
were observed before the mice ever received rewards on the tread- 
mill (Extended Data Fig. 4h, i), and the signalling was highly similar 
after the introduction of unpredicted rewards. Thus, contrary to 
current widely accepted models, these results establish the existence 
of a widespread population of dopaminergic axons projecting to the 
dorsal striatum that display rapid phasic signalling associated with 
ongoing accelerations, which are expressed independently of reward 
expectation. 

We then investigated whether the dopamine signalling during 
continuous locomotion might affect subsequent movements. We 
observed a significant correlation between calcium transient ampli- 
tudes and the amplitudes of the immediately following acceleration 
bursts (Spearman’s Rho = 0.13, P= 0.006, all transient-acceleration 
pairs; binned data, Extended Data Fig. 5d), despite a weaker temporal 
association (compared to previous acceleration bursts, Extended 
Data Fig. 5a, b). Furthermore, just before locomotion termination 
(stopping), calcium transient amplitudes associated with the last 
acceleration burst (before termination) were reduced compared to the 
transient amplitudes expected from similar acceleration bursts dur- 
ing continuous locomotion (Fig. 2c, arrow). Thus, during continuous 
locomotion, dopamine axon transients seem to be more temporally 
associated with the previous (rather than following) acceleration burst 
(Fig. 2b, e, f, g, Extended Data Fig. 5a, b) and may positively impact 
the next acceleration burst (Extended Data Fig. 5e), perhaps acting as a 
feedback signal to promote plasticity or continued locomotion. 


Locomotion triggered by dopamine axon stimulation 

To test whether sub-second timescale signalling in dopamine axons can 
trigger or alter locomotion, channelrhodopsin* (ChR2) was expressed 
in midbrain dopamine neurons of DAT-cre mice (1 =7) and the dopa- 
mine axons in the dorsal striatum were (bi-laterally) optically stimu- 
lated (Fig. 3a, Methods). Optical stimulation consisted of 166 or 83 ms 
duration bursts of 8-ms pulses (pulses repeated at 60 Hz); these bursts 
were then repeated at 3 or 6 Hz, respectively (see Fig. 3b; frequencies 
corresponding to rhythmic accelerations measured during average 
(3 Hz) or fast (6 Hz) locomotion; Extended Data Fig. 4e) to create 
stimulation trains. This was intended to generate rapid phasic dopa- 
mine axon signalling similar to that observed during locomotion. The 
laser stimulation trains reliably trigged rapid transitions from resting 
to locomotion (or occasionally rocking forward and back; Fig. 3c—g; 
statistically significant acceleration increase averaged across all mice 
and sessions, P < 0.01, shuffle test; statistically significant increase in 
acceleration, averaged across all sessions for 6 out of 7 individual mice, 
P<0.01, shuffle test), although not every stimulation led to locomotion 
(Fig. 3d, e). Transitions to locomotion were rapid: for 3 Hz burst trains, 
acceleration significantly increased from rest after the first burst (that 
is, ten 8-ms pulses; significantly increased acceleration compared to 
shuffle ~160 ms after burst onset, P < 0.01; Fig. 3g; similar to timing 
during spontaneous initiations, Fig. 2a, d, f, Extended Data Fig. 41), 
whereas for 6 Hz burst trains, acceleration significantly increased from 
rest after two bursts (that is, two bursts of five 8-ms pulses; signifi- 
cantly increased acceleration compared to shuffle ~240 ms after burst 
onset, P < 0.01; Fig. 3g). Interestingly, 3 or 6 Hz stimulation trains 
that continued after locomotion initiation were capable of shifting 
the acceleration frequency in individual mice (Fig. 3h) towards the 
stimulation frequency during the ensuing locomotion bout. All mice 
(n=7) displayed higher frequency accelerations during locomotion 
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Figure 3 | Pulsed optogenetic stimulation of dorsal-striatum- 
projecting dopamine axons can rapidly initiate locomotion and 
control acceleration frequency. a, Schematic of methods. b, Pulsed 

laser delivery protocol for dopamine axon ChR2 stimulation (used for 

all data presented in this paper). c, Representative acceleration from a 
single stimulation session (blue regions, laser train stimulation to the 
dorsal striatum). d, Absolute value of mouse acceleration aligned on onset 
of all laser stimulation trains (red line) during a representative single 
session from one mouse (each row represents a single trial stimulation 
from rest). e, All mean accelerations aligned on laser stimulation trains 
applied to mice at rest (1 =7 mice total, each row represents mean over 
one session; rows corresponding to sessions for each mouse grouped 
together; sessions from different mice separated by dashed lines). f, Mean 
absolute (abs) value of mouse accelerations aligned on the onsets of laser 
stimulation trains applied to mice at rest (mean across all stimulation 
onsets, n = 161 and 267 onsets for ChR2 and control, respectively, in all 
sessions and mice, 3 and 6 Hz stimulation included). g, Same data as 

f, but stimulations separated into 3-or 6-Hz groups and zoomed-in time 
(blue, laser stimulation). Dashed lines, time when acceleration becomes 
significantly (P < 0.01) greater than random shuffle (thin grey line, mean 
random shuffle). h, Power spectra of acceleration for two representative 
mice during locomotion periods initiated during 3- and 6-Hz stimulations. 
i, j, Centre of mass of acceleration power spectra (COM) for each ChR2 (i) 
or control (j) mouse for locomotion periods initiated during 3-and 6-Hz 
stimulations. Horizontal bars indicate means, lines connect same mouse. 
k, Mean difference between centre of mass of acceleration power spectra 
during 3 and 6-Hz stimulations (differences calculated for each mouse) 
averaged across control or ChR2 mice. 1, Mean velocity across mice 
during 3 and 6-Hz stimulations included in k. Shaded regions in f and g 
denote mean + s.e.m. *P < 0.05; **P< 0.01 (compared to 0), Wilcoxon 
rank-sum test. 


bouts initiated during 6 Hz stimulation than during 3 Hz stimulation 
(Fig. 3i, k, significant shift, P< 0.01, Wilcoxon rank-sum test) despite 
equivalent mean velocities during these periods (Fig. 31). In two mice, 
acceleration power spectra peaks matched the respective stimulation 
frequencies (one of the two mice in Fig. 3h, top) and accelerations 
during locomotion were significantly entrained to the laser stimu- 
lations (P < 0.01, shuffle test; Extended Data Fig. 6a, b; entrainment 
also significant across all mice, P< 0.01 shuffle test, Extended Data 
Fig. 6c, d). Notably, the same optical stimulation applied to control mice 
without ChR2 expression failed to trigger locomotion or shift or entrain 
locomotion frequency (n= 7; Fig. 3f, j, k, Extended Data Fig. 6c, d). 
Thus, optogenetic stimulation of dorsal-striatum-projecting dopa- 
mine axons can rapidly trigger locomotion bouts and shift or entrain 
rhythmic acceleration frequency, demonstrating that signalling in 
dorsal striatal dopamine axons can impact locomotion movements on 
sub-second timescales. 
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Dopamine axon functional heterogeneity 
Unpredicted rewards are well-known to trigger phasic bursts of 
dopamine signalling *'>'®. Surprisingly, we observed little or no pha- 
sic response to randomly delivered water rewards in the mean whole- 
field AF/F traces; instead the signals decreased in accordance with the 
deceleration from locomotion to reward consumption or remained flat 
when reward was delivered at rest (Extended Data Fig. 4f, g). However, 
a small, sparse population of axons were found that signalled during 
reward (Fig. 4a). To characterize reward and locomotion signalling 
at the individual axon level, we then sparsely labelled small popula- 
tions of midbrain dopamine neurons (Fig. 4b, Extended Data Fig. 7), 
resulting in sparse labelling of axons in dorsal striatum. This enabled 
us to resolve the fluorescence changes from putative single axons 
(Fig. 4c, d, Extended Data Fig. 8a, b) and revealed that, as suggested by 
the widespread labelling experiments (Fig. 4a), putative single axons in 
the same dorsal striatum region can differentially signal unpredicted 
reward and locomotion (Fig. 4d-i). Thus, the dopaminergic projection 
to the dorsal striatum is heterogeneous with respect to unpredicted 
reward and locomotion, but is dominated by locomotion signalling. 
We then examined whether these functionally distinct axons emanate 
from different midbrain nuclei**. Injections were localized primarily 
to either SNc (1n=5 mice) or medial VTA (n=5 mice) (Extended 
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Data Fig. 7). In total, 73 isolated putative single axons (Methods) were 
imaged from SNc and 98 from VTA, of which 63 out of 73 (86%) and 
72 out of 98 (73%), respectively, responded to locomotion and/or 
reward events (P < 0.01, Wilcoxon rank-sum test, locomotion versus 
rest; during unpredicted reward delivery, P < 0.01, shuffle test versus 
non-reward periods; or during both). The large majority of the respon- 
sive VTA axons signalled unpredicted reward (Fig. 4k, |, n; 53 out of 
72 axons, 74% reward responsive). These responses were consistent 
with documented single-unit electrophysiology and fast-scan cyclic 
voltammetry**"'°!87; they were absent on omission trials, scaled with 
reward-size, were not present to licking in the absence of reward, and 
were present whether reward was delivered during running or rest 
(Extended Data Fig. 9). In contrast to the VTA, such prominent reward 
signalling was absent from active dopamine axons emanating from SNc 
(Fig. 4k, 1, n, Extended Data Fig. 9; 0 out of 63 axons). Instead, all event 
responsive axons from SNc displayed significantly increased signalling 
during locomotion compared to rest (Fig. 4n; 63 out of 63 axons). This 
signalling was transient rather than sustained (Fig. 4j, m, Extended 
Data Fig. 2g, h) and displayed a timing preference with respect to accel- 
eration similar to the whole-field measurements (Fig. 2a-g), indicating 
that single SNc axons, rather than different subpopulations, can signal 
during locomotion initiation and continuous locomotion (Extended 


Figure 4 | Functional heterogeneity and anatomical origin of dorsal- 
striatum-projecting dopamine axons. a, Mean image during locomotion 
periods minus mean image during reward periods for two representative 
dopamine axon fields (arrows, axon segments signalling more during 
reward). b, Coronal sections showing restricted GCampé6f expression 
(green) in VTA (top) and SNc (bottom) overlaid with tyrosine hydroxylase 
immunofluorescence (red). c, Representative maximum fluorescence 
projection image from one field in a mouse with GCaMP6f expression 
localized to VTA (n=5 total VTA-targeted mice). Gold and purple regions 
are identified regions of interest (ROIs) from putative single axons (grey, 
background). d, AF/F from ROIs in ¢ aligned with treadmill velocity and 
reward time. Bkg, background. e, Reward-triggered AF/F from the purple 
(top) and gold (bottom) ROIs from c; each row represents a single reward 
delivery. f, Mean reward-triggered AF/F (gold, purple, top), velocity 
(black, bottom) and licking (grey, bottom) for the two axons in c over 

the recording session. g, h, Same as e, f, except triggered on locomotion 
onsets. i, Same as a, but for field shown in c. j, Mean AF/F triggered on 
acceleration onsets for all putative single axons from all SNc-injected 

(n=5 mice, 73 axons, green) and VTA-injected (1 =5 mice, 98 axons, 
red) mice. Bottom, mean acceleration. k, Mean AF/F triggered on reward 
for all putative single axons in j from SNc (green) and VTA (red). Bottom, 
mean mouse licking triggered on reward delivery. l, Reward-triggered 
AF/F for all putative VTA (left) and SNc (right) axons (same axons in 

j, k, sorted by reward response magnitude); each row represents mean 
AF/F for each axon. m, Mean AF/F triggered on acceleration onsets 

(top) and rewards (bottom) for all significant (P< 0.01) VTA reward only 
responsive (n = 43 axons from 5 mice, red) and SNc locomotion responsive 
(n= 63 axons from 5 mice, green) axons. n, Reward response versus 
locomotion index for all VTA (n=5 mice, top) and SNc (n=5 mice, 
bottom) putative axons. Green, significant locomotion; red, significant 
reward; blue both significant; neither significant not shown. 0, Mean 
AF/F during locomotion (bottom) and rest (top, non-reward periods 
during rest) periods for all locomotion (n= 19 axons, 4 mice green) 

and reward (n= 43 axons, 5 mice red) responsive VTA axons. 

*P.< 107°, Wilcoxon rank-sum test. Shaded regions in j, k and m denote 
mean + s.e.m. 
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Data Fig. 8c-e). Only a minority of the total responsive VTA (Fig. 4k, 
m-o; 29 out of 72 axons, 40%) and reward signalling VTA (Fig. 4m-o; 
10 out of 53 axons, 19%) axons significantly increased signalling during 
locomotion compared to rest, but similar to the SNc projections, this 
signalling was transient and acceleration-associated. Overall, reward 
and locomotion signalling were negatively correlated in the heteroge- 
neous VTA axon population (Fig. 4n; Rho = —0.51, P=5.7 x 10~°). 
The presence of both reward and locomotion signalling axons from 
the VTA probably reflects some within-region heterogeneity, however, 
some expression spillover into SNc*° or SNc/VTA border region 
may contribute to the locomotion population. Taken together, these 
results demonstrate a previously unknown functional heterogeneity 
within dorsal-striatum-projecting dopamine axons, which, for the 
most part, arise from different anatomical nuclei: sparsely projecting 
reward signalling axons originate primarily from VTA while densely 
projecting SNc originating axons generate an acceleration-associated 
locomotion-signal. 


Dopamine signalling functional topography 

Owing to the prominence of unpredicted reward signalling in pre- 
vious recordings*”*'7!9*7-99, the small fraction of reward signalling 
dopamine axons in dorsal striatum was surprising. However, because 
ventral striatum receives most of its dopaminergic innervation from 
VTA, whereas dorsal striatum receives most from SNc?°, we reasoned 
that locomotion and unpredicted reward signalling might be similarly 
graded across the striatum. Single-photon photometry***? was used 
to record from the GCaMP6f-expressing dopamine axons along 
the dorsal-ventral axis (Fig. 5a; n =5 mice). This method lacked single 
axon resolution, but provided a means of recording throughout the 
striatum with minimal tissue damage. Recordings from dorsal 
striatum revealed acceleration-associated dopamine signalling highly 
similar to that observed with two-photon imaging (Figs le, 2b versus 
Fig. 5b-d, Extended Data Fig. 10a-f; serving as a control for the cortical 
lesion required for two-photon imaging). More ventrally, acceleration- 
associated signalling declined while unpredicted reward signalling 
increased (locomotion index versus depth, Rho =0.7, P=6 x 10~° 
reward signal versus depth: Rho = —0.65, P=5.2 x 1075; Fig. 5c, d, 
Extended Data Fig. 10g) and dominated in ventral striatum (Fig. 5c, d, 
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Figure 5 ence Functional topography of reward and locomotion dopamine 
signalling across striatum dorsal-ventral axis. a, Schematic of 
photometry methods. b, Representative photometry AF/F (green) and 
treadmill accelerations (black) during locomotion and reward delivery 
(dashed lines) periods from dorsal (top) and ventral (bottom) striatum 
from same mouse (same session, n = 5 mice total). c, Representative 
photometry AF/F from individual locomotion initiations (left) and reward 
deliveries (right) measured at indicated depths (not averaged). d, Mean 
AF/F across all photometry recording sessions (n = 5 mice) triggered on 
acceleration onsets (left) and reward deliveries (right) at depths indicated 
in c (black, mean acceleration). 
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Extended Data Fig. 10g). Consistent with this result, pulsed optogenetic 
stimulation of dopamine axons in ventral striatum had comparatively 
little effect on locomotion (Extended Data Fig. 6e-h; n = 4 mice, same 
ChR2-expressing mice as used in Fig. 3). No significant dopamine 
signalling was seen in the overlying cortex (Fig. 5c, d). Thus, a functional 
topography in dopamine signalling exists across the striatal dorsal- 
ventral axis**, with locomotion and reward signalling dominating 
dorsal and ventral striatum, respectively. 


Discussion 

The rapid, phasic, acceleration-associated locomotion signalling of 
dopamine projection axons described here represent a new mode of 
dopaminergic modulation of striatal circuitry and establish strong evi- 
dence for the long-elusive role for dopamine neurons in fast-timescale 
motor-control. This result is perhaps surprising given the little evi- 
dence for movement-related dopamine signalling previously observed; 
this difference may be due to task and/or methodological differences 
(Supplementary Discussion). Our results support new models of dopa- 
minergic signalling that incorporate rapid, sub-second modulation of 
movement by dopamine!®°”! over current models that posit a permis- 
sive involvement by a slowly varying background'*!*!>° (Extended 
Data Fig. 10h-k). The phasic signalling occurred in the absence of 
action requirements or rewards, suggesting a general role in internally 
driven movement control (Extended Data Fig. 4). 

Our optogenetic stimulation experiments support the idea that the 
phasic signalling in dopamine axons observed with optical record- 
ing can trigger and entrain locomotion movements on a sub-second 
timescale, demonstrating the utility of using measurements of natural 
neuronal dynamics"! to define stimulation* parameters for effecting a 
desired behavioural outcome”. Dopamine signalling is part of a com- 
plex circuit regulating voluntary movement, and therefore is unlikely 
to instruct movement alone. The behavioural effects of these signals 
may therefore be strongly modulatory, rather than deterministic, an 
idea supported by the variability in signalling timing and stimulation 
effects on locomotion (Fig. 3d, h-k, Extended Data Fig. 4j, k), although 
this could also be partly owing to the limited region imaged or stim- 
ulated. Dopamine signalling could interact differently with the many 
converging components of the movement network during movement 
initiations versus continuous movements. At initiations (Fig. 2a, d, f, 
Extended Data Fig. 4a, b, j), dopamine may provide the necessary 
motivational drive to execute an intended movement. During ongo- 
ing movements, dopamine signalling lagging behind accelerations 
(Fig. 2b, e, f, g, Extended Data Fig. 4c, d) may modulate upcoming 
movements (Extended Data Fig. 5d, e), determine whether dorsal 
striatum circuitry maintains an existing motor pattern (Fig. 2c), 
or instruct motor learning. Importantly, these findings establish a 
movement signal that is probably compromised in Parkinson disease, 
and suggests the potential therapeutic use of restoring precisely timed, 
movement-associated dorsal striatum dopamine signalling”’. 

Finally, distinct populations of axons signalling to reward or loco- 
motion were observed both locally within dorsal striatum and glob- 
ally across the striatum’s dorsal-ventral axis suggesting regional 
tuning to regulate the specific functions of the terminal regions**””. 
Heterogeneity of midbrain dopamine neurons has been described 
in terms of inputs***’, gene expression***’ and electrophysiological 
properties*”; such cellular heterogeneity may align with the functional 
subpopulations described here (Supplementary Discussion)?" 
Specific targeting of these populations, and the local mechanisms such 
as cholinergic modulation*?™ that shape their signalling, may provide 
novel therapeutic opportunities for neurological disorders. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Virus injections for optical recording. All experiments were approved by the 
Northwestern University Animal Care and Use Committee. Heterozygous adult 
male mice (postnatal 3-4 months) with Cre expression in DAT-containing mid- 
brain dopamine neurons (GENSAT line Tg(Slc6a3-cre)SG62Gsat/Mmucd) were 
anaesthetized with isofluorane (1-2%). For achieving widespread expression 
of GCaMP6f in dopaminergic neurons (n= 6, Figs 1, 2, 4a, 5, Extended Data 
Figs 1, 2k, 3e, 4, 5, 10), a 0.5—1-mm diameter craniotomy was made over the right 
midbrain (—3.4mm caudal, +1.0 lateral from bregma). A small volume (0.04 11) 
of flexed-GCaMP6f virus (AAV 1-Syn-flex-GCaMPé6f, 1.4 x 10'8 genome copies 
(GC) ml"? diluted 1:1 in PBS; University of Pennsylvania vector core) was pressure 
injected through a pulled glass micropipette into the midbrain at two caudal 
locations (~—3.3 and —3.5mm from bregma) and 3 depths at each location 
(—4.1, —4.4 and —4.7 mm ventral from dura surface) for a total volume of 
~0.24 11 For sparse, targeted injections (Fig. 4b-o, Extended Data Figs 2, 3, 7-9), 
the craniotomy was centred over either the medial VTA (n=5; caudal —3.1mm, 
lateral 0.2 mm) or lateral SNe (1 =5; caudal —3.4mm, lateral 1.5mm) and virus 
(0.025-0.05 11) was injected at two ventral locations (—4.3, —4.6mm) for a total 
of ~0.08 1l. Following the injections, the skull and craniotomy were sealed with 
Metabond (Parkell) and a custom metal headplate. 

Dorsal striatum window implant surgery. One week after injection, mice were again 
anaesthetized, and the headplate and metabond removed. A ~2.75 mm craniotomy 
was performed above the dorsal striatum (+-0.5 mm rostral, +1.8-2.1 mm lateral to 
bregma) and cortical tissue was aspirated until the internal capsule fibres overlying 
the striatum were exposed”°. Some of these fibres were then carefully removed 
until only a thin layer remained over the striatal surface. A thin (~100,1m) 
layer of Kwik-Sil (WPI) was applied over the imaging region, and a metal can- 
nula (stainless-steel tube, 2.7 mm outer diameter, 2.3 mm inner diameter, 1.6mm 
length) covered at one end by a glass coverslip”® was inserted into the aspiration 
site down to the fibre surface (Extended Data Fig. 2i). The imaging cannula, along 
with a metal headplate, and positioning ring (see ref. 26 for details) were sealed 
and attached to the skull with Metabond. 

Behaviour and two-photon imaging. After the window surgery, the mice were 
allowed to recover in their home cages for ~2-3 days. After this time, the mice 
began water scheduling (receiving ~0.8-1 ml of water per day) as described previ- 
ously”*°!, The mice displayed no obvious signs of motor deficits following window 
implantation or water scheduling. Then 3-4 weeks after implant surgery, mice 
were head-fixed with their limbs resting on a 1D cylindrical styrofoam treadmill 
~20cm in diameter by 13cm wide (treadmill described previously”, Fig. 1b) in 
the dark, which allowed them to run freely forwards and backwards. Large (1611) 
and small volume (4,11) water rewards were delivered at pseudo-random time 
intervals (28.1 + 19.8s; mean +s.d.) through a water spout gated electronically 
through a solenoid valve once imaging began, and spout licking was measured 
by a contact monitoring circuit. Solenoid valve triggering was accompanied by a 
short ‘click’ noise, which allowed us to conduct reward omission trials in which 
the water tube was manually closed (Extended Data Fig. 9a, b). Rotational velocity 
of the treadmill during locomotion was sampled at 1,000 Hz by a rotary encoder 
(E2-5000, US Digital used for most experiments; a few early experiments used a 
E6B2-CWZ3E, Yumo) attached to the axel of the treadmill. Instantaneous acceler- 
ation was calculated as the difference between consecutive treadmill velocity (first 
smoothed over two bins) measurements. Live video of behaviour on the treadmill 
was recorded at 30 Hz via a table mounted infared CCD camera (Computa) illu- 
minated by an infared light source. 

Two-photon imaging was performed using a custom, table mounted microscope 
system and data acquisition system described in ref. 53 (except the electric lens was 
not used here). Laser (920 nm) average power was in the range of 100-200 mW, but 
typically closer to 100 mW: This power measurement was made after the objective 
(Olympus LUMPlanFL N, 40x, 0.8 NA), but before the cannula, which apertures 
the excitation light, likely further reducing the power reaching the striatum. A 
Digidatal440A (Molecular Devices) data acquisition system was used to record 
(Clampex 10.3) and synchronize reward timing, licking, wheel velocity, and 
two-photon image frame timing. Imaging sessions began after mice were accli- 
mated to head fixation and ran frequently on the treadmill (~1-3 days). To rule out 
the influence of prior reward experience on locomotion responses, a subset of mice 
(n=4) were imaged before receiving any water rewards on the wheel (Extended 
Data Fig. 4h, i). Time-series videos (3,000-12,000 frames) were acquired at 28 Hz 
(256 x 64 pixels, 0.5 ms per line). A subset of the sparsely labelled single axon 
recordings were performed at 14 Hz (256 x 128 pixels, 0.5 ms per line) to achieve 
better spatial resolution for resolving small axonal arbors. For triggered averages 
(Fig. 4j, k, m), AF/F fluorescence traces generated from these 14 Hz imaging data 
sets were linearly interpolated to match the data collected at the 28 Hz sampling 
rates. Field of view sizes ranged from ~60 to 250m. 
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Striatum imaging fields (60-250 1m diameter fields) were selected based on 
the presence of fibre morphology with at least occasional calcium transients in the 
fibres (field were not selected based on the behavioural correlation of the transients, 
only whether transients were clearly present) and, for widespread dopamine neu- 
ron expression experiments (Figs 1, 2, Extended Data Figs 1, 3, 4, 5), densely 
labelled projections across the field, or for sparse dopamine neuron expression 
experiments (Fig. 4, Extended Data Figs 7-9), few labelled projections in the field. 
Fields ranged in depth from 20-100 1m below the fibre surface. Each mouse was 
imaged for ~5-10 days, with multiple fields imaged per day (an ‘imaging sessio’ 
consisted of time-series recording from a single field, typically for many minutes); 
~5-15 total different fields were imaged per mouse across all days (not all sessions 
met behaviour criteria to be included in all analyses, see below); no attempt was 
made to locate the same imaging field/axons from day to day, except in a small 
subset of experiments to demonstrate that axon morphology and behavioural sig- 
nalling remains stable over days (Extended Data Fig. 2). Imaging up to ~700 1m 
from the centre of the cannula was possible, although most fields were acquired 
near the centre of the cannula. Thus, most of our imaging fields were located 
more laterally in the dorsal striatum (~1.5-2.4mm lateral from the midline), but 
some medial fields were obtained as well (most medial, ~1.1 mm from midline, 
Extended Data Fig. 1). Imaging along the rostral-caudal axis ranged from —0.2 to 
1.2mm from bregma. There was no preference for recording location within this 
large range of positions and no obvious differences in dopamine signalling were 
observed within this range. 

Interpretation of mechanisms underlying calcium transients and characteri- 
zation of putative single dopamine axon calcium transients. The time course 
of fluorescence changes caused by changes in cytoplasmic calcium concentration 
are well-known and characterized”””, providing a significant framework within 
which to interpret our observed calcium transients (Extended Data Fig. 3a—-c). For 
example, a short tens of milliseconds timescale influx of calcium (such as a short 
burst of action potentials over tens of milliseconds) results in a stereotyped calcium 
transient consisting of a fluorescence increase to peak over ~50-70 ms followed 
by an exponential decay to baseline over ~150-250ms (Extended Data Fig. 3a). 
Multiple influx events separated by less than the indicator decay time (for example, 
longer bursts of action potential firing over ~100s of milliseconds) lead to longer 
duration, larger amplitude calcium transients due to summation (Extended Data 
Fig. 3b), while a sustained increase in the rate of influx events separated by less than 
the decay time (for example, sustained increase in action potential firing) leads to 
sustained elevations or apparent shifts in the fluorescence baseline due to ongoing 
summation*™ (Extended Data Fig. 3c, left). Thus, GCaMPé6f fluorescence changes 
are well-suited for determining whether mouse locomotion (accelerations on the 
100s of millisecond timescale) is associated with sub-second phasic dopamine 
signalling (rapid transient changes in fluorescence), slowly varying dopamine 
signalling (long-duration transients or changes in the GCaMP6f baseline level 
over many seconds or minutes) or both. However, our imaging methods probably 
cannot reliably detect single action potentials in single axons, or the difference in 
transient amplitude due to an additional spike in a burst. Furthermore, it is pos- 
sible that small, slow changes in action potential firing could occur between rest 
and locomotion and not be detected using our optical recording methods. Also, 
although we did not detect slow changes in signalling here, such changes could 
occur over timescales longer than our typical recording sessions (many minutes). 
Optical stimulation of dopamine axons. To achieve widespread expression 
of ChR2 in dopaminergic axons in the striatum, DAT-cre mice (n= 7; 5 male, 
2 female; postnatal 3-6 months) were injected bilaterally (0.2-0.3 11 total per 
hemisphere) as above with AAV9-EFla-DIO-hChR2 (University of Pennsylvania 
vector core diluted 3:1 in PBS to 3.5 x 10! GC m17!) in the midbrain (—3.3 mm 
caudal, +1.3 mm lateral from bregma) at three depths (—4.1, —4.4, —4.7mm from 
the dura surface). Headplates for head fixation were then attached to the skull 
with a thin layer of Metabond and the location of the striatum (+0.5 mm rostral, 
+1.8 mm lateral from bregma) was marked on the surface. After 4-6 weeks>®, 
mice were anaesthetized with isoflourane and small craniotomies (~0.5-1 mm in 
diameter) were drilled (at the pre-marked locations) through the Metabond and 
skull over the striatum, leaving the dura and cortex intact. The craniotomy was 
then sealed with Kwik-Sil. After the mice recovered from this short (~10-15 min) 
surgery for at least one day, they were head-fixed on the linear treadmill. When 
mice began to exhibit periodic, spontaneous run-rest transitions on the treadmill, 
the Kwik-Sil covering the craniotomies was removed and two optical fibres (200 jm 
diameter, ThorLabs) were lowered slowly into the brain bilaterally (to various 
depths ranging from dorsal (1.6-2.2 mm) to ventral (3.6-4.1 mm) striatum) to 
allow for delivery of laser light (488 nm diode laser, Thor Labs). Laser stimulation 
(5mW laser power at the fibre tip) trains consisted of 83 ms ON/83 ms OFF (6 Hz) 
or 166 ms ON/166 ms OFF (3 Hz) periods and the laser was pulsed (8 ms ON/8 ms 
OFF, using an optical chopper, ThorLabs) during the 83 or 166 ms ON periods. 
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We refer to this stimulation as a train of 8-ms pulse bursts repeated at 3 or 6 Hz. 
For example, for 6-Hz stimulation, the laser stimulation trains consisted of the 
following: 8 ms ON/8 ms OFF/8 ms ON/8 ms OFF (repeated for 83 ms) followed 
by 83 ms with no stimulation and then the whole sequence repeated (Fig. 3b). The 
onset of laser stimulation trains were triggered manually via a pulse generator and 
synchronized to output from the treadmill rotary encoder through our data acqui- 
sition system (see above). Stimulation trains (duration 8.6 + 2.8 s) were initiated 
periodically at pseudo-random intervals primarily during rest periods during 1-8 
stimulation blocks (sessions) per mouse per day at varying depths in the striatum 
(3-5 min block durations, all occurring during a single ~40 min recording period 
on the treadmill per day per mouse). Importantly, bright LED lights (470 nM and 
530nM ThorLabs) provided constant illumination onto the face of the mice on 
the treadmill during the stimulation blocks to mask the mice from the fibre optic 
light delivered to their brain. After stimulation was completed, craniotomies were 
again covered with Kwik-Sil to allow for additional sessions on subsequent days 
(1-3 days per mouse). We note that effects highly similar to those shown in 
Fig. 3 and Extended Data Fig. 6a—d were obtained using the above laser stimula- 
tion trains, but with either faster pulsing (4ms ON/4ms OFF) during the 83 or 
166 ms ON periods or no pulsing at all (only 83 ms ON/83 ms OFF, 6 Hz or 166 ms 
ON/166 ms OFE, 3 Hz); however all data presented in the manuscript were obtained 
using the 8-ms pulsing during bursts protocol. 

Single fibre photometry. For fibre photometry, we coupled the blue diode laser 
excitation light (488 nm diode laser, Thor Labs; filtered using a 488/6 nm filter, 
Semrock) into an optical fibre (200|1m diameter, ThorLabs) and used neutral 
density filters between the laser and fibre coupling lens to adjust power (1 mW at 
fibre tip). Fluorescence was collected through the same fibre and was separated 
from the excitation light by placing a dichroic mirror (505 nm-shortpass, Semrock) 
between the laser and fibre coupling lens. Fluorescence light was further filtered 
(540/50 nm, Semrock) and then measured using a GaAsP PMT (H10770PA-40, 
Hamamatsu). The PMT signal was digitized and recorded using the electronics 
and ScanImage software from our two-photon microscope (described above). 
Synchronization of the recorded fluorescence with treadmill movements was 
accomplished as described above for two-photon imaging. 

DAT-cre mice (n= 5) were injected with viral vectors to obtain ubiquitous 
GCaMPéf expression in midbrain dopamine neurons as described above for 
two-photon imaging experiments and fitted with plates for head-fixation as 
described for the ChR2 experiments. At 3-5 weeks after injection, mice were 
trained for 1-3 days on the linear treadmill with rewards until they ran reliably. 
Mice were then anaesthetized with isoflourane and a unilateral craniotomy 
(~0.5-1 mm in diameter, dura and cortex were left intact) over the central striatum 
(+0.5A/P, +1.3-1.8mm M/L from bregma) was performed. After recovery from 
the short (~10-15 min) surgery, mice were head-fixed on the linear treadmill. 
When mice began to run regularly on the treadmill, the optical fibre was lowered 
slowly into the brain to various depths ranging from Cortex to Ventral Striatum 
for photometry recording (Fig. 5, Extended Data Fig. 10). Unpredicted rewards 
were delivered as described above. 

Data analysis. Data were analysed using ImageJ (1.46) and custom functions writ- 
ten in MATLAB (Version 2012b). Sample sizes were chosen to reliably measure 
experimental parameters while remaining in compliance with ethical guidelines 
to minimize the number of animals used. Experiments did not involve blinding, 
but randomization was used with respect to reward delivery and order of data 
collection from control versus experimental mice. All data in the text and figures 
are labelled as either mean + s.d. or mean +s.e.m. 

Motion correction and ROI selection for two-photon microscopy. Each time- 
series video was motion corrected using algorithms described previously”. 
Videos were then visibly inspected to confirm post-correction image stability and 
videos with excessive motion artefact or apparent z-shifts were excluded from 
analysis. To quantify whole-field fluorescence (Figs 1, 2, Extended Data Figs 1, 
2k, 3e, 4, 5) a single large region of interest (ROI) was hand-selected (to contain 
all the active regions of the field) from the mean fluorescence image for each time 
series. Fluorescence traces and ROIs from putative single axon segments from 
the sparsely labelled fields (Fig. 4, Extended Data Figs 7-9) were generated using 
CellSort?’ with the following parameters: smoothing width = 1.5, mu=0.8, prin- 
cipal components = 200, area limits = 50-3,000 pixels. ROI fluorescence traces 
and ROI morphology were visibly inspected and selected for inclusion as a puta- 
tive axon if they exhibited elongated morphology consistent with axons (Fig. 4c) 
and activity patterns with predominantly positive-going fluorescence changes (no 
structures with slowly varying fluorescence activity patterns were observed). The 
fluorescence traces were then baseline normalized (eighth percentile fluorescence 
over a 1,000 bin, ~35.7s, sliding window, to remove any slow drifts in baseline; 
see below for measurements of slow changes in baseline) and converted to AF/F. 
Significant positive-going transients (used for Figs 1g, 2g, 4k-o; Extended Data 


Figs 2g, h, 3d, e, 4h, i (bottom), 5a, b, 7-9, 10a—c) were identified as previously 
described”*. To account for axon segments that probably originated from the same 
neuron, hierarchical clustering (simple linkage) of the linear correlation matrix 
between AF/F traces of all segments in each field was performed using a cor- 
relation coefficient threshold of 0.8 (results were similar using other clustering 
thresholds, Extended Data Fig. 8a, b), and AF/F traces belonging to the same 
cluster were averaged. To verify single axon separation from background, analysis 
was repeated only on the most sparsely expressing fields; separation of functional 
axon classes was qualitatively similar to Fig. 4n using this more conservative 
approach (data not shown). 
Relationship of fluorescence signals to treadmill locomotion. All mice (n=6 
whole field, n= 5 for each single-axon imaging group, n = 5 for photometry) 
contributed to averaged measures unless otherwise noted in the text or figure 
captions. Individual imaging sessions were excluded if they did not meet criteria 
for each specific behavioural analysis (that is, locomotion initiation, continuous 
locomotion, locomotion terminations; see below for each), hence the session 
numbers varied slightly for each measure. For general comparisons of fluores- 
cence changes in whole fields and single axons during locomotion and rest periods 
(Figs 1f, g, 4n, o, Extended Data Figs 2f, 4h, 7, 8,10a—c, g), we classified each imag- 
ing frame (~35-ms bin size) as occurring during active locomotion, passive rest, or 
neither (uncertain, excluded). All bins occurring within 5.5 s post-reward delivery 
or during a spontaneous licking period were excluded. Locomotion bins were iden- 
tified as those that had a treadmill velocity measure greater than 0.2ms~! and an 
acceleration greater than 1.7ms ? ina 200-ms window on either side. Conversely, 
rest bins were defined as having no accelerations greater than 0.3 ms” or veloci- 
ties greater than 0.05 ms" !. These parameters were determined based upon visual 
comparison of live video with acceleration and velocity traces and are probably 
conservative constraints. To be included for locomotion/rest comparisons, behav- 
iour over the entire imaging session had to include a minimum of 30 rest bins and 
30 locomotion bins. Slow, sustained changes in baseline fluorescence (Figs 1f and 
Extended Data Fig. 10a—c) were quantified by comparing AF/F for locomotion 
and resting periods that did not contain significant calcium transients. The AF/F 
baseline normalization was re-computed for this calculation using a larger sliding 
window (2,000 bins, ~70.4s) to ensure that slow events over multiple locomotion 
bouts (Extended Data Fig. 3e) would be detected if they were present. Further, 
examination of AF/F traces with no baseline normalization did not reveal slow 
events corresponding to locomotion bouts (Extended Data Fig. 2k). 

Locomotion initiation times for triggered averages of fluorescence (Figs 2a, d, 
f, 4h, Extended Data Figs 4a, b, j, 10d-f) were first identified manually by visual 
inspection of velocity traces for each session. Typically, mice exhibited clear tran- 
sitions between rest (periods with few treadmill accelerations) and continuous 
treadmill locomotion. Bursts of acceleration not followed by a prolonged eleva- 
tion in velocity (indicative of continuous locomotion) were not selected. Manually 
selected initiations were included for analyses only if they were followed (within 
10 bins, ~350 ms) by an acceleration peak greater than 1 ms ? and were not 
preceded within 1s by any velocity bins larger than 0.05 ms! or accelerations 
larger than 0.3ms 2. Acceleration-triggered averages were calculated relative to the 
onset of the first acceleration (after the manually defined time point) greater than 
0.3ms~. Locomotion terminations (Fig. 2c) were also selected manually as the 
first deceleration to rest following locomotion bouts. One termination time-point 
was selected for each initiation. Terminations were included if they were preceded 
(within 10 bins) by a maximum velocity greater than 0.1 ms~!, indicating a sharp 
locomotion to rest transition. For defining periods of acceleration occurring during 
continuous running (Fig. 2b, e, f, Extended Data Figs 4c, d, k, m, 8d, e), acceler- 
ations were required to be preceded (within a ~200-ms window) by a minimum 
run velocity of (0.1 ms!) and followed by a mean run velocity (over 200 ms) of 
0.1ms~!. Non behaviour-specific acceleration triggered averages of fluorescence 
(Figs 4j, m, 5d, Extended Data Fig. 4i, top) were calculated for all accelerations 
exceeding 1.7ms ? (that is, accelerations occurring during continuous locomotion, 
locomotion onsets and any other short movements not falling into either of these 
two categories, though the vast majority occurred during continuous locomotion). 

For acceleration triggered averages of AF/F, single average AF/F traces were 
derived for each field (Figs 1 and 2), putative single axon (Fig. 4), or photometry 
recording location (Fig. 5, Extended Data Fig. 10) across all identified accelera- 
tions (meeting acceleration criteria, triggered on the zero crossing of the acceler- 
ation trace when the acceleration was increasing and crossed a threshold value 
>1.7ms *) in an imaging session; and these traces were averaged across sessions 
and mice to generate summary traces. AF/F triggered averages of acceleration 
were calculated analogously (Fig. 2g, Extended Data Fig. 4i, bottom, triggered on 
the onsets of significant, positive-going AF/F transients). For these traces, and for 
analysis of amplitudes of preceding and following accelerations bursts (Extended 
Data Fig. 5c, d) only significant transients <0.5s duration were included to avoid 
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inclusion of any sustained summated transients. Shuffled triggered averages were 
computed relative to an equivalent number of randomly selected time-points 
(randsample in MATLAB). Correlations (Pearson’s) between acceleration and 
AF/F were calculated between the (concatenated) traces around all initiation bins 
(+750 ms around each) or continuous locomotion periods (+2 after locomo- 
tion initiation to locomotion termination). Note that differences in the criteria 
for selecting different behaviour periods (locomotion initiations, terminations, 
continuous running periods, etc) results in the inclusion of a different number of 
fields or axons for each behaviour (that is, Fig. 2a—e). 

Single-photon fibre photometry fluorescence data was first averaged over 
35.7-ms bins and then corrected for background signal (intrinsic fluorescence 
and laser bleed-through). The signal recorded in the unlabelled cortex (estimated 
in a subset of the recordings) was assumed to be background and this signal was 
subtracted from the striatum signal. After this subtraction, the remaining photom- 
etry signal (assumed to be GCaMP6f fluorescence) was analysed identically to the 
whole field two-photon imaging data sets described above. 

Relationship of acceleration to ChR2 axon stimulation. Significant effects of 
laser stimulation on locomotion initiations (Fig. 3d—g) were assessed by comparing 
the mean absolute value of acceleration within a 0.5-2-s window (for each mouse 
and across all mice, Fig. 3f) or for each bin (Fig. 3g) following stimulation train 
onsets that were delivered during rest (no large accelerations within 2s before 
stimulation) across all mice and stimulation trains to the means obtained from 
1,000 rounds of randomly selecting an equivalent number of non-laser stimulation 
rest periods from the same stimulation sessions. Sessions were included if mice 
exhibited spontaneous locomotion/rest transitions in the absence of stimulation 
and exhibited average resting bout durations of >6 (to allow comparison with 
spontaneous transitions from rest to running). Stimulation effects were consid- 
ered significant if the post-stimulation means were greater than 990 of the 1,000 
randomly selected means (P< 0.01). Note that because we analysed laser train 
onsets that were preceded by 2s of rest, there was a slightly higher probability for 
mice to be moving in a window after the stimulation onset simply by chance. This 
explains the apparent small change in the control group (Fig. 3f) at laser train onset 
and in the shuffle group (Fig. 3g). A similar test was applied to assess significant 
frequency dependent acceleration entraiment (Extended Data Fig. 6c, d). Laser 
burst triggered averages of acceleration were computed during locomotion periods 
beginning after laser train onset and the mean of absolute value of these triggered 
averages were compared with 1,000 averages from randomly selected non-laser 
stimulation locomotion periods. Consistent phase relationships between the laser 
stimulation and acceleration produce significant positive and negative peaks in the 
triggered average (Extended Data Fig. 6c,d ), whereas traces with no consistent 
phase relationship result in flat triggered averages (see controls, Extended Data 
Fig. 6c, d). Multi-taper power spectral densities of acceleration (Fig. 3h) were con- 
structed for all locomotion periods (minimum 3s of locomotion per stimulation 
train period) during 6 Hz and 3 Hz stimulation periods in each block using the 
mtspectrumc function in the Chronux library (http://chronux.org/) with 5 tapers 
and a time bandwidth product of 3 over the interval 1 Hz to 8 Hz. Power spectra 
across blocks were averaged to produce mean power spectra for each mouse (one 
each for 6 and 3 Hz stimulations, Fig. 3h). The centre of mass (Fig. 3i-l, Extended 
Data Fig. 6f-h) was calculated from the top 25% of the mean power spectra for 
each mouse (although highly similar results were obtained using the top 50%, 75% 
or all of the power spectra). 

Classification of functional signalling related to reward and locomotion 
across putative single axons and striatum locations. Axon heterogeneity in 
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single fields relative to reward and locomotion periods (Fig. 4a, i) was visualized 
by computing the mean fluorescence image during locomotion bins (see above) 
and subtracting the mean fluorescence image during reward delivery (a 750-ms 
window following reward delivery). Locomotion index (Fig. 4n and Extended Data 
Figs 7, 8, 10g) was the difference between the mean AF/F during locomotion and rest 
(A/AFmove — A/AF,est) and axons were classified as significantly locomotion 
responsive via a Wilcoxon rank-sum test (P< 0.05) between all locomotion and 
rest AF/F values. Reward responses (Fig. 4n and Extended Data Figs 7, 8, 10g) were 
defined as the mean AF/F in a 300-ms window after reward delivery. Significance 
of reward responses were computed with a bootstrap test (P< 0.05) where the 
P value was the probability of receiving an equivalent or larger mean value from 
1,000 rounds of randomly selecting a trigger bin from all non-reward bins. 

ROC analysis. ROC (receiver operating characteristic) curves (Extended Data 
Fig. 4n, 0) to assess the ability of calcium transients to discriminate locomotion 
from rest were computed using the ‘perfcurve function in MATLAB for all bins 
for each imaging field that were classified as locomotion or rest based on the cri- 
teria defined above. The positive state was defined as locomotion, the negative as 
rest (Extended Data Fig. 4n). For the onset ROC analysis to assess ability of cal- 
cium transients to discriminate pre-locomotor from other rest periods (Extended 
Data Fig. 40), all resting bins were classified as either being pre-onset (positive 
state, within 250 ms before defined onset; see above) or non-pre onset (negative 
state). 

Histology. Mice with injections targeting SNc or VTA (see above) were perfused 
trans-cardially with 15ml PBS (Fischer) and 15 ml of 4% paraformaldehyde 
(EMS). Brains were stored in PBS at 4°C then transferred to 40% sucrose (Fischer) 
overnight before sectioning. Coronal slices (40-50 1m) were cut on a freezing 
microtome and stored at 4°C in PBS. For immunostaining of dopaminergic 
neurons, sections from a subset of mice were blocked in 5% serum, incubated 
overnight at 4°C with antibodies for tyrosine hydroxylase (1:700 dilution, Millipore 
AB152, Fig. 1d, Extended Data Fig. 7), then incubated with secondary antibodies 
tagged with Alexa Fluor 594 (Jackson). Large area imaging of tyrosine hydroxylase 
and GCaMP6f expression was performed on an Olympus Slide Scanner (VS120) 
microscope, and high magnification imaging of co-labelling (Fig. 1d, right) was 
performed with a confocal microscope. 
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Extended Data Figure 1 | Synchronized dopamine projection axon 
dynamics across a single field in dorsal striatum. a, Representative mean 
fluorescence image in dorsal striatum of a dense field of dopamine axons 
(compare to the sparse fields from sparse labelling in Fig. 4c, i) from one 
mouse (out of six) labelled with GCaMP6f. b, Coronal schematic showing 
approximate location and scale of region imaged at top. Arrow indicates 
the approximate range of medial/lateral positions used for two-photon 
imaging (see Methods). c, AF/F traces for the rectangular ROIs indicated 


ROI number 


ina. d, Correlation matrix for the ROIs indicated in a. Note high degree 
of transient co-activation across ROIs. e, Mean image of fluorescence 
during the transient indicated by the arrow in c minus mean image of 
fluorescence during non-transient periods for the field shown in a. Note 
that the morphology of active regions closely resembles the morphology 
of GCaMP6f-expressing axons in the whole field in a, indicating 
synchronous activation of large, dense regions of axons, likely belonging 
to several different parent neurons. 
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Extended Data Figure 2 | Further characterization of two-photon 
imaging and analysis methods. a, b, Top, example mean fluorescence 
images of a putative single SNc axon imaged over 2 consecutive days in 
one mouse. c, d, Mean images of fluorescence during locomotion periods 
minus mean image of fluorescence during reward periods for fields in 

a and b. White axonal regions indicate regions of elevated signalling 
during locomotion. Note the similar morphology and behaviour signalling 
of the identified axon (red arrow) over days. e, Acceleration (black) and 
AF/F (green) for the identified axon in a-d across the two imaging days. 
Note the similar transient amplitudes and the elevated transient signalling 
during locomotion acceleration periods. f, Mean transient AF/F (mean 

of significant transients, excluding baseline periods) during locomotion 
and rest on days 1 and 2 for the axon shown in a-d. g, Histograms of 
calcium transient duration times across all putative single axons imaged 
in dorsal striatum from SNc (n= 3,556 transients, 5 mice top) and VTA 


1000 


(n=5140 transients, 5 mice bottom). Note the similar duration profile 
across the two populations (medians not significantly different, P > 0.05 
Wilcoxon sign-rank test). h, Histograms of maximum calcium transient 
amplitudes across all putative single axons imaged in dorsal striatum from 
SNc (top) and VTA (bottom) (medians not significantly different, P > 0.05 
Wilcoxon sign-rank test). i, Post-mortem image of a coronal section from 
a representative mouse showing the striatum imaging cannula window 
cortical lesion site. j, Post-mortem image from a different mouse than 

i that was used for fibre photometry recording (fibre track indicated 

by arrow). k, Similarity of raw (no baseline normalization) whole-field 
GCaMPéf F trace (top) with baseline normalized AF/F trace (bottom) for 
the example whole-field imaging session shown in Fig. le. In the top trace, 
note the lack of baseline change over the recording session and particularly 
the stability of the baseline level during locomotion periods. Bottom trace 
is duplicate of trace in Fig. le. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Interpretation of mechanisms underlying 
calcium transients and characterization of putative single dopamine 
axon calcium transients. a, Left, mock trace representing expected 
GCaMPéf calcium transient from a short millisecond timescale influx 

of calcium (arrows; for example, short burst of action potentials over 

tens of milliseconds; local modulation may also contribute to calcium 
influx). Right, three representative low-amplitude, short duration calcium 
transients (from putative single dopamine axons in dorsal striatum) 

(see Methods, Fig. 4) that display onset and decay kinetics consistent with 
mock transient (left). b, Left, mock trace representing expected GCaMP6f 
calcium transient from multiple calcium influx events separated by less 
than the indicator decay time (arrows; for example, longer burst of action 
potential firing over ~100s of milliseconds). Right, representative larger 
amplitude calcium transients (from putative single dopamine axons in 


dorsal striatum) with rapid rise times consistent with mock transient (left). 


c, Left, mock trace representing expected GCaMP6f calcium transient 
from a sustained increase in the rate of influx events separated by less 
than the decay time (for example, sustained increase in action potential 
firing). Right, representative trace of one of the longest duration calcium 
transients observed (from putative single dopamine axon in dorsal 
striatum). Note that no sustained increases (baseline shifts) similar to the 
mock trace (left) were observed in single axon recordings or whole-field 


ARTICLE 


AF/F measurements; however, the long duration transient shown (right) 
indicates that if such sustained increases had occurred, they would have 
been detected using our methods. Also, note that the mock traces shown 
in a-c are for descriptive purposes and are not based on new data. These 
traces are based on two main assumptions: (1) AF/F is a monotonically 
increasing function of intracellular calcium concentration, which itself is 
a monotonically increasing function of the number of underlying action 
potentials (that is, a greater number of action potentials leads to a larger 
AF/F, but the relationship is not necessarily linear); and (2) AF/F 
transients summate (not necessarily linearly) when they overlap in time. 
d, Duration versus peak AF/F for all identified significant calcium 
transients in putative single SNc originating axons (see Methods, 

Fig. 4, n = 3,556 transients from 73 axons in 5 mice; Spearman’s Rho = 0.3 
P<10~1). e, Histogram of sustained locomotion period durations (from 
SNc injected mice, n = 5, top) and calcium transient durations for all 
putative single SNc axons (nm =5 mice, mid) and all whole-fields (n= 6 
mice, bottom). Note that the median calcium transient duration (for 
either single axon or whole-field) is far less than the median locomotion 
duration, indicating that the increase in dopamine axon GCaMP6f 
AF/F observed during locomotion is due to an increase in relatively 
short duration calcium transients, rather than long-duration (sustained) 
increases in AF/F. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | Further characterization of acceleration- 
associated dopamine signalling. a, Representative whole-field AF/F 
fluorescence trace (one field from one out of six mice, green) aligned to 
treadmill acceleration (black) during a locomotion onset (dashed line: 
onset). b, Video frames of mouse for time points shown in a. c, d, Same 

as a and b but for a period of continuous locomotion. e, Top, normalized 
spectral power of treadmill acceleration trace during continuous 
locomotion periods for each two-photon imaging session (each row 
represents a session, m = 6 mice). Bottom, normalized mean power from 
all sessions shown in e, top. f, Top, mean whole-field AF/F trace triggered 
on reward delivery time for all fields; each row is mean for each field/ 
session (n= 22 fields, 6 mice). Bottom, mean treadmill velocity (black), 
mean whole-field AF/F (green), and spout licking (light blue) all triggered 
on reward time (mean across all 22 fields/sessions in 6 mice). g, Mean 
whole-field AF/F (top), velocity (mid), and acceleration (bottom) for trials 
in which reward was delivered mid-locomotion (n= 13 sessions, 

4 mice, red) or when animals were at rest (1 = 12 sessions, 4 mice, blue). 
Note the sharp decrease in AF/F relative to baseline when animals 
decelerated from locomotion to consume the reward and the relative 
absence of phasic reward signalling when animals were given reward 
from rest. Reward responses were also not observed in single SNc axons 
when animals received reward from rest (see Extended Data Fig. 9). 

h, Comparison of mean whole-field fluorescence change from significant 
calcium transients (excluding baseline periods) between locomotion 

and resting periods; each point represents mean AF/F for running or 
resting over one session for each field (lines connect same field/session). 
All fields included here were imaged before mice ever receiving any 
rewards on the treadmill (n= 14 fields, 4 mice). *P < 107° (Wilcoxon 
rank-sum test). i, top, Mean acceleration (black) and whole-field AF/F 
(green) triggered on acceleration onsets (mean across all fields) during 
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continuous locomotion. Bottom, mean acceleration (black) triggered on 
all short duration calcium transients (green, mean of transients) during 
continuous locomotion across all fields. All fields included here were 
imaged before mice ever receiving any rewards on the treadmill (n = 14 
fields, 4 mice). j, Top, whole-field AF/F from all locomotion initiations in 
a representative single session (single imaging field, single session, one out 
of 6 mice); each row represents a single locomotion initiation time period 
(sorted by peak AF/F time). Mid, treadmill accelerations corresponding 
to locomotion initiations shown in j, top. Bottom, average of acceleration 
(black) and AF/F (green) across all locomotion onset traces displayed in 
Top and Mid panels. k, same as j, but for continuous locomotion periods. 
1, Reproduction of Fig. 2a with zoomed-in time axes to show the timing of 
the mean DF/F in relation to the first acceleration at locomotion initiations 
from rest. Shaded red region indicates bins that were significantly 

(*P < 0.01, Wilcoxon sign-rank, n= 15 fields in 6 mice) elevated relative 
to rest baseline. Shaded region covers ~107 ms before acceleration onset. 
m, Reproduction of Fig. 2b with zoomed-in time axes to show the timing 
of the mean dopamine transient in relation to the accelerations during 
continuous locomotion (n= 18 fields, 6 mice). n, ROC curves for each 
two-photon whole-field AF/F trace (n = 22 fields from 6 mice, grey; mean, 
black line) assessed for ability to discriminate locomotion versus resting 
periods (21 out of 22 exhibited significant discriminability, P< 0.01). Area 
under the curve (AUC) = 0.76 + 0.02 (mean +s.e.m.). 0, ROC curves for 
each two-photon whole-field AF/F trace (n= 17 fields in 6 mice, grey; 
mean, black line) assessed for ability to discriminate pre-locomotion onset 
rest periods (250 ms before onset) from other rest periods (10 out of 17 
exhibited significant discriminability, P < 0.01, two sessions included 

did not meet onset criteria for Fig. 2a, Methods). AUC = 0.58 + 0.02 
(mean +s.e.m.). Dashed red lines indicate the line of no discrimination. 
Shaded regions in f, g, i, 1, m denotes mean +s.e.m. 
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Extended Data Figure 5 | Dopamine axon calcium transients are 
temporally associated with preceding acceleration bursts and 

their amplitude is correlated with both preceding and subsequent 
acceleration bursts. a, b, Distribution of latencies from each significant 
calcium transient onset (mean whole-field fluorescence; 6 mice) to the 
first acceleration burst onset within 1s preceding (n = 1,087, 6 mice, a) 

or following (n = 990, 6 mice, b) during continuous locomotion. Latencies 
are less variable (F-test for difference between variance of latencies, 
P=7.1 x 10-*) and shorter (Wilcoxon test for difference between latency 
means, P= 1.2 x 107°) to the preceding acceleration onsets, indicating 
more precise relative timing between the GCampéf transients and the 
preceding acceleration burst versus the following acceleration burst. 

c, d, Mean acceleration traces from the first acceleration (within 1 s) 
preceding (c) or following (d) all short duration (<0.5s) large amplitude 
(>75th percentile, n = 149 transients, grey) and small amplitude 

(<25th percentile, n = 149 transients, bronze) calcium transients 
occurring during continuous locomotion; aligned on acceleration onsets. 


Insets are schematics of the GCaMP@6f transients. A significant correlation 
is present between the transient amplitudes and the immediately preceding 
acceleration amplitudes (Spearman’s Rho = 0.16, P=1.2 x 10~“, from all 
transient-acceleration pairs; binned data from this plot shown in c). 

A significant correlation is also present between the transient amplitudes 
and the immediately following acceleration amplitudes (Spearman's 

Rho = 0.13, P=0.006, from all transient-acceleration pairs; binned 

data from this plot shown in d). e, Schematic summarizing relationship 
between the timing and amplitude of dopamine axon calcium transients 
and acceleration bursts during continuous locomotion. f, Mean 
acceleration (black) and whole-field AF/F (green) all triggered on all 
accelerations during continuous locomotion that were less than 1.7ms * 
in amplitude (n = 596 accelerations, n = 6 mice); this demonstrates 

that dopamine axon GCaMPéf signalling displays a timing preference 
with respect to small amplitude accelerations, with similar timing and 
amplitude to that shown in Fig. 2b (which includes both large and small 
amplitude accelerations). 
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Extended Data Figure 6 | Pulsed optogenetic stimulation of dorsal- 
striatum-projecting and ventral-striatum-projecting dopamine axons. 
a-d, Pulsed optogenetic stimulation of dorsal-striatum-projecting 
dopamine axons can entrain accelerations during locomotion. e-h, Pulsed 
optogenetic stimulation of ventral-striatum-projecting dopamine axons 
leads to little effect on locomotion. a, b, Representative acceleration 
traces from continuous locomotion periods during (and initiated by) 

6 Hz (a) and 3 Hz (b) laser stimulation trains in the same mouse. Blue, 
laser stimulation train, one mouse out of seven. c, d, Mean accelerations 
triggered on individual laser burst onsets during continuous locomotion 
periods for 6 Hz (c) and 3 Hz (d) across all laser bursts in all mice and 
sessions (n =7 mice). e, Mean absolute value of mouse accelerations 
aligned on onset of laser stimulation train applied to mice at rest (mean 
across all stimulation onsets, n =55 and 91 for ventral and dorsal 
respectively, in all sessions and mice, 3 and 6 Hz stimulation included). 
Dorsal and ventral striatum stimulations are from same ChR2-expressing 
mice (n= 4). Three mice were not stimulated in ventral striatum and 


thus not included in this figure. Mean acceleration elicited by ventral 
stimulation was significantly (P < 0.01, Wilcoxon rank-sum test) less 
than that elicited by dorsal axon stimulation. However, acceleration from 
ventral stimulation was significantly greater than chance (P< 0.01 shuffle 
test). This small effect in the ventral striatum could be due to activation 
of fibres, which also project to the dorsal striatum or to an increase in 
arousal. Although note that acceleration frequency during locomotion 
was not altered for stimulation in ventral striatum (see f, h). f, g, Centre 
of mass of acceleration power spectra for each mouse for locomotion 
periods initiated during 3 or 6 Hz stimulations (n = 4 mice). Horizontal 
bars indicate means, lines connect same mouse. h, Mean difference 
between the centre of mass of the acceleration power spectra computed 
for locomotion periods initiated during 3 Hz or 6 Hz axon stimulations in 
ventral (f) or dorsal (g) striatum. Positive values indicate a shift towards 
higher frequency accelerations for 6 Hz stimulations. *P < 0.05, Wilcoxon 
rank-sum test. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


VTA 


» 


Extended Data Figure 7 | Histology and response distributions from 
each sparsely injected mouse. a, GCaMP@6f expression (green) and 
immunofluorescence (red) from all VTA-targeted mice (n = 5). b, Reward 
response versus locomotion index (as in Fig. 4n) for each axon recorded 
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from the corresponding mice in a. Green, significant locomotion; red, 
significant reward; blue both significant; neither significant not shown. 

c, d, Same as a and b, except from all SNc-injected mice (n = 5). Six out of 
ten mice were not stained for tyrosine hydroxylase. Scale bars, 500 jm. 
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Extended Data Figure 8 | Distribution of reward and locomotion indexes 
and fraction of reward and locomotion signalling axons from VTA 

and SNc are highly similar using different correlation thresholds for 
clustering axon segments (a, b); single axon signaling timing during 
locomotion onset versus continuous locomotion (c-e). a, Reward 
response versus locomotion index for putative single axons from SNe (n=5 
mice, top row) and VTA (n=5 mice, bottom row) using different correlation 
thresholds (no clustering, 0.5, and 0.2) for hierarchical clustering of activity 
patterns (see Methods). Axons are colour-coded by significant responses 

to locomotion (green), reward (red), or both (blue). Note that despite the 
total number of putative axons decreasing with correlation threshold, the 
inverse relationship between locomotion and reward signalling across 


the population remains the same. b, Table showing the total numbers and 
fractions of responsive axons across the VTA and SNc populations for 
different clustering thresholds. Note that despite the total number of putative 
axons decreasing with correlation threshold, the fraction of axons signalling 


either reward, locomotion, both or neither is highly similar. c, Correlations 
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(Pearson's) between acceleration and selected putative single SNc axon 

AF/F traces at different relative time-lags (that is, cross-correlations) during 
locomotion initiation periods; each row is mean for each axon for a single 
session (axons from n= 3 out of 5 mice). d, Same as c, but during continuous 
locomotion periods; same axons during same sessions as in c. e, Peak 
cross-correlation times for data shown in c and d (lines connect same axons 
during same sessions). 
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Extended Data Figure 9 | Further characterization of putative single 
dopamine axons in relation to reward and licking. a, Mean AF/F trace 
for VTA reward responsive axons (Methods, n = 23 axons, n= 4 mice 
with variable reward sessions) (top), velocity (mid), and licking (bottom) 
triggered on large volume (red), small volume (blue) and omission (black, 
solenoid click was present, but no reward delivered; Methods, n= 17 
axons, 3 mice with omission sessions) reward deliveries. b, Same as a 
except for SNc locomotion responsive axons (n= 62 and 18 axons for 
reward and omission traces respectively). c, Mean VTA reward axon 


AF/F trace (top) and velocity (bottom) triggered on reward deliveries 
during continuous locomotion (red, m = 25 axons) or rest periods (blue, 
n=37 axons). d, Same as ¢ except for SNc locomotion responsive axons 
(n= 25 and 62 axons for locomotion and rest respectively). e, Mean 

VTA reward axon AF/F (top) and mouse licking (bottom) triggered on 
spontaneous, non-reward licking onsets (n = 15 axons, 3 mice). f, Same as 
e except for SNc locomotion responsive axons (n = 15 axons, 3 mice). Mice 
that did not lick outside reward periods were excluded. Shaded regions in 
a-f denote mean +s.e.m. 
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Extended Data Figure 10 | See next page for caption. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Extended Data Figure 10 | Dopamine axon locomotion signalling 
measured by fibre photometry from different striatal sub-regions. 

a, Top, comparison of mean photometry fluorescence AF/F (mean of 
significant transients, excluding baseline periods) recorded from dorsal 
striatum between locomotion and resting periods; each point represents 
mean AF/F for running or resting over one session for recording from a 
single dorsal striatum location (lines connect same recording location/ 
session; n =5 mice). Bottom, comparison of mean baseline (periods with 
no significant calcium transients) photometry AF/F recorded from dorsal 
striatum between locomotion and resting periods; each point represents 
mean baseline AF/F for running or resting over one session for recording 
from a single dorsal striatum location (lines connect same recording 
location/session, n=5 mice). b, c, Same as a, except for recordings from 


central and ventral striatum, respectively. d, Mean photometry AF/F 
recorded from dorsal striatum triggered on locomotion initiations (mean 
across all initiations, n =5 mice). e, f, Same as d, except for recordings 
from central and ventral striatum, respectively. g, Locomotion index 

(top) and reward response (bottom) versus striatum recording depth 
(from data presented in Fig. 5). h, Schematic of prominent current 

model for dopamine signalling dynamics in the striatum. i, Schematic 

of our new model for dopamine signalling dynamics based on data 
presented here. j, k, Saggital schematics illustrating current homogenous 
dopamine signalling model (j) and our new model incorporating 
functional heterogeneity (k). **P < 0.01, Wilcoxon rank-sum test; n.s., not 
significant. Shaded regions in d-f, mean + s.e.m across initiations (n = 20, 
28 and 56 from dorsal, central and ventral respectively in 5 mice). 
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Human commensals producing a novel 
antibiotic impair pathogen colonization 


Alexander Zipperer!?*, Martin C. Konnerth**, Claudia Laux!?, Anne Berscheid‘, Daniela Janek!7+, Christopher Weidenmaier?”, 
Marc Burian®, Nadine A. Schilling*:’, Christoph Slavetinsky!?, Matthias Marschal°, Matthias Willmann*°, Hubert Kalbacher’, 
Birgit Schittek®, Heike Brétz-Oesterhelt?*, Stephanie Grond*, Andreas Peschel? & Bernhard Krismer!* 


The vast majority of systemic bacterial infections are caused by facultative, often antibiotic-resistant, pathogens 
colonizing human body surfaces. Nasal carriage of Staphylococcus aureus predisposes to invasive infection, but the 
mechanisms that permit or interfere with pathogen colonization are largely unknown. Whereas soil microbes are known 
to compete by production of antibiotics, such processes have rarely been reported for human microbiota. We show that 
nasal Staphylococcus lugdunensis strains produce lugdunin, a novel thiazolidine-containing cyclic peptide antibiotic 
that prohibits colonization by S. aureus, and a rare example of a non-ribosomally synthesized bioactive compound from 
human-associated bacteria. Lugdunin is bactericidal against major pathogens, effective in animal models, and not prone 
to causing development of resistance in S. aureus. Notably, human nasal colonization by S. lugdunensis was associated 
with a significantly reduced S. aureus carriage rate, suggesting that lugdunin or lugdunin-producing commensal bacteria 
could be valuable for preventing staphylococcal infections. Moreover, human microbiota should be considered as a source 


for new antibiotics. 


Infections caused by highly antibiotic-resistant bacteria have greatly 
increased in recent years and represent a major cause of morbidity 
and mortality worldwide, including in developed countries!”. Multi- 
drug resistant organisms (MDRO), such as methicillin-resistant 
S. aureus’, vancomycin-resistant enterococci*, and third-generation 
cephalosporin-resistant Gram-negative bacteria”, are expected to 
become more frequent causes of death than cancer in the coming 
decades®. Despite the urgent need for new antibiotics that are effec- 
tive against resistant bacteria, very few compounds are in develop- 
ment, the majority of which are congeners of currently used antibiotic 
classes”®. Nevertheless, innovative approaches for cultivation of yet 
uncultured potential antibiotic producers or activation of silent bio- 
synthetic gene clusters have recently led to the discovery of entirely 
new antimicrobial compounds with useful properties” '*. Moreover, 
new compound sources, such as the large inventory of antimicrobial 
host defence peptides from higher organisms, open new avenues for 
the development of new anti-infectives!**. As the available anti- 
biotics lose their efficacy, it is important to limit the ongoing spread 
of resistant bacteria’®. Unfortunately, most of the major MDROs are 
increasingly disseminating in the community and cannot be effec- 
tively targeted by standard infection-control measures”. The strong 
antibiotic selection pressure in humans and in farm animals and the 
increasing fitness of methicillin-resistant S. aureus, vancomycin- 
resistant enterococci, and third-generation cephalosporin-resistant 
Gram-negative bacteria in competition with commensals leads to 
the symptomless and usually unrecognized presence of MDRO in 
the microbiota of many humans'*'®. Notably, the vast majority of 
systemic bacterial infections are caused by endogenous pathogens 
from human microbiota, and MDRO-colonized individuals are 
exposed to a substantially higher risk of invasive infections, that are 
difficult to treat, when they undergo surgery or immunosuppression, 


or suffer from trauma!!®”°. S. aureus is found in the anterior nares 
of approximately 30% of the human population. Eradication by the 
antibiotic mupirocin strongly reduces predisposition to invasive 
infection’. However, mupirocin has to be applied repeatedly over 
five days, requiring strict compliance and costly extension of patient 
hospitalization; in addition, rates of mupirocin resistance are steadily 
increasing”!. Moreover, current decolonization strategies targeting 
MDRO at other body surfaces, such as the intestine, depend on 
broad-spectrum antibiotics and are highly controversial”*. Thus, there 
is not only an urgent need for new therapeutic antibiotics but also for 
new effective MDRO decolonization strategies. 

Although many ongoing research efforts address major virulence 
and resistance mechanisms, the processes governing bacterial 
fitness, competition, and dissemination in microbiota remain poorly 
understood. Recent studies have shown how diverse and dynamic 
human microbiota are, especially those of the environment-exposed 
microbiota of the skin”? and upper airways”. These ecological niches 
are particularly poor in nutrients”, suggesting that colonizing bacteria 
are probably in strong competition and may use a variety of strategies to 
overcome competitors*>”®, Bacteria from human microbiota have occa- 
sionally been found to produce bacteriocins, antimicrobial substances 
acting against closely related bacteria””**. Genes related to antibiotic 
biosynthesis have been identified in human metagenomes” leading 
to the discovery of lactocillin, a ribosomally synthesized thiopeptide 
antibiotic produced by a human vaginal commensal*°. However, the 
potential roles of such complex bioactive compounds in shaping human 
microbiota are still unknown. 


Antimicrobial activity of a nasal commensal 
A previously described collection of nasal Staphylococcus isolates” was 
screened for antimicrobial activity against S. aureus. Whereas most 
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Figure 1 | Lugdunin production by wild-type S. lugdunensis and 
isogenic mutants. a, Bioactivity assay with the S. lugdunensis IVK28 wild 
type, the AlugD mutant and the complemented mutant against S. aureus 
USA300. Basic medium plates were inoculated with a lawn of S. aureus 
USA300. S. lugdunensis IVK28 cells from overnight cultures of wild 

type, the lugdunin-deficient deletion mutant AlugD and complemented 
mutant AlugD::pRB474-lugD were spotted onto the plates. b, HPLC-UV- 
chromatogram shows comparison of S. lugdunensis 1VK28 wild type and 
the lugdunin-deficient transposon mutant M1. Cell extracts of the IVK28 
wild type (blue) and the transposon mutant M1 (red) were compared by 
reversed-phase HPLC-UV. The inlay depicts the absorption spectrum 

of lugdunin with the prominent absorption at 280 nm, indicating the 
presence of tryptophan. Absorbance was measured in milli absorption 
units (mAU). 


isolates did not affect S. aureus, the strain S. lugdunensis IVK28 was 
found to have a particularly strong capacity to prevent the growth of 
S. aureus (Fig. 1a). 1VK28 produced an antibacterial substance only 
under iron-limiting conditions and only on solid agar surfaces, not in 
liquid culture. 

Transposon mutagenesis of 1VK28 led to isolation of mutant M1, 
which did not inhibit S. aureus. Analysis of the transposon inser- 
tion site revealed the disruption of an uncharacterized gene encod- 
ing a putative non-ribosomal peptide synthetase (NRPS; position 
860375/76 of SLUG_RS03940 in the annotated genome sequence of 
S. lugdunensis N920143 ref. 31; accession number, NC_017353.1). This 
gene is encoded in an operon of approximately 30 kbp with several 
other NRPS and antibiotic biosynthesis-related genes (Fig. 2a and 
Extended Data Fig. 1a), suggesting that the inhibitory molecule of 
IVK28 may be a complex, non-ribosomally synthesized peptide com- 
pound. The NRPS operon was found in all S. lugdunensis genomes in 
the databases, and it was identified by PCR in each of the S. lugdunensis 
strains from our strain collection, indicating that it is characteristic 
of the species, rather than a strain-specific feature. Nevertheless, the 
GC-content of the operon of 26.9% is clearly distinct from that of the 
whole genome (33.8%) indicating that the gene cluster may have been 
transferred to S. lugdunensis by horizontal gene transfer from another 
bacterial species. The operon consists of four NRPS genes (named lugA, 
B, C, D) encoding adenylation domains for five amino acids (Fig. 2b). 
The Jug operon further includes all genes whose products are required 
for the synthesis and transport of a non-ribosomally synthesized pep- 
tide compound (Extended Data Fig. 1a). Gene clusters for secondary 
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metabolite production are frequently found in streptomycetes and 
other soil bacteria but are rare in human-associated bacteria”. The lug 
operon was exclusively found in S. Jugdunensis and encodes a unique 
combination of antibiotic biosynthesis enzymes, all with less than 
35% identity to any other described enzyme, suggesting that it may 
be responsible for biosynthesis of a novel compound. To confirm that 
the Jug operon is responsible for the antimicrobial activity of 1VK28, 
the smallest NRPS gene, lugD, was deleted by gene replacement. The 
mutant AlugD showed no detectable antimicrobial activity, but the 
phenotype was restored by complementation with a plasmid-encoded 
copy of lugD (Fig. 1a). 


Lugdunin, a novel peptide antibiotic 

The antimicrobial activity of [VK28 was enriched by ethanol extraction 
of agar-grown cells. Reversed-phase high-pressure liquid chromatog- 
raphy (HPLC) coupled with ultraviolet (UV) and mass spectrometry 
revealed obvious differences between extracts from IVK28 wild-type 
and mutant M1 in only one signal (Fig. 1b). The molecular formula, 
C4pHe2NgO¢8S, deduced from electrospray ionization high-resolution 
mass spectrometry (Extended Data Fig. 2a, b; [M+H]* CagHe3NgO¢S*, 
calculated 783.45858, found 783.45850, A0.1 parts per million (p.p.m.), 
relative molecular mass = 783.03) did not correspond to any known 
molecule. Spectral UV absorbance at 280 nm indicated the presence 
of a tryptophan moiety (Fig. 1b). As the compound was not produced 
in liquid culture, which impeded isolation of sufficient amounts for 
chemical characterization and biological profiling, expression of the 
biosynthetic genes was uncoupled from the native Jug operon promoter 
and its regulation. This strategy has recently been successfully applied 
to other antimicrobial gene clusters’”**°. The putative tetR-like regula- 
tor gene of the Jug operon (JugR) was deleted, and the strong xylAB pro- 
moter was inserted upstream of the NRPS genes, along with the gene 
for its corresponding xylose-sensitive repressor XylR. Cultivation of the 
resulting strain S. lugdunensis IV K28-Xyl (Extended Data Fig. 1b) in 
liquid culture with xylose led to strong production of inhibitory activity, 
which could be purified by consecutive steps of 1-butanol extraction, 
gel filtration, and reversed-phase HPLC. The pure compound exhibited 
exactly the same mass as determined for the compound from the wild- 
type strain IVK28 and was named lugdunin. 

Nuclear-magnetic resonance (NMR), electrospray ionization 
high-resolution mass spectrometry, and an advanced Marfey’s anal- 
ysis of lugdunin revealed a cyclic peptide (Fig. 2c, Extended Data 
Fig. 3a—-e, and Extended Data Table 1) comprising an unusual thiazo- 
lidine heterocycle and five amino acids (D-valine, L-tryptophan, 
p-leucine, L-valine, and p-valine) (Fig. 2c). The thiazolidine building 
block occurs in certain linear NRPS products, such as watasemycins™ 
and yersiniabactin*®, but is yet unreported in macrocyclic peptides 
(Extended Data Fig. 4). The lugdunin thiazolidine ring is probably 
formed by condensation of an N-terminal L-cysteine with a C-terminal 
L-valine residue (position one and seven, respectively, of the precursor 
peptide) upon reductive release of a linear heptapeptide aldehyde from 
the NRPS mega-enzyme by the terminal reductase of LugC (Fig. 2b and 
Extended Data Fig. 4d). Total chemical synthesis of lugdunin yielded a 
product with identical chemical properties and antimicrobial activity 
compared to the natural product from strain IVK28, which confirmed 
the assigned lugdunin structure. 

The origin of lugdunin from a heptapeptide was unexpected because 
the NRPS proteins from the Jug operon encompass only five adenyla- 
tion domains, central enzymatic domains whose number usually deter- 
mines the number of incorporated amino acids (Fig. 2b). The predicted 
specificities of NRPS adenylation domains*® were in agreement with the 
identified amino acids, except for position three (tryptophan instead 
of threonine), indicating a new specificity for the second adenylation 
domain of LugA. Moreover, LugC exhibited an unusual modular 
organization®” with one adenylation domain (valine) but two 
downstream peptide-bond-forming condensation and three pep- 
tidyl carrier protein domains required for amino acid transfer. This 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


lugC 


g 
§ 
a as 
WH He 


Thiazolidine 


Figure 2 | Gene cluster, proposed biosynthesis pathway and chemical 
structure of lugdunin. a, NRPS genes /ugA, B, C, D. b, Modular 
organization of gene products. Functional domains: A, adenylation; 

P, peptidyl carrier protein; C, condensation; E, epimerization; 

R, reductase. The sequential biosynthesis of lugdunin starts presumably at 
the characteristic initiation module of LugD and continues with LugA-C. 


organization suggested that the single adenylation domain of LugC 
is responsible for activation of three consecutive valine units, which 
are subsequently incorporated in alternating L- and p-configurations. 
A partially similar mechanism has been described for yersiniabactin 
biosynthesis** (see also Supplementary Discussion). Thus, the LugC 
domains represent an unusual assembly line compared to characterized 
NRPS machineries. 


Activity against major human pathogens 

Lugdunin has potent antimicrobial activity against a wide range of 
Gram-positive bacteria, including opportunistic pathogens such 
as difficult-to-treat methicillin-resistant S. aureus, glycopeptide- 
intermediate resistant S. aureus and vancomycin-resistant 
Enterococcus isolates. Minimal inhibitory concentration (MIC) 
values in the micromolar range (1.5-12 pg ml~!; 1.9-15.3 pM) 
demonstrated high potency, and this activity was not impaired in 
the presence of human serum (Table 1). Lugdunin was bactericidal 
against methicillin-resistant S. aureus with complete killing at 10 x MIC 
(Fig. 3a). Lugdunin did not cause lysis of primary human neutro- 
phils or erythrocytes, and even high amounts of lugdunin showed no 
substantial inhibition of the metabolic activity of the human monocytic 
cell line HL60 (ICs > 50 1g ml!) (Extended Data Fig. 5). Bacterial 
cells exposed to lugdunin stopped incorporating radioactive DNA, 
RNA, protein or cell-wall precursors almost simultaneously even 
at concentrations below the MIC (Extended Data Fig. 6), suggesting 
that lugdunin may lead to rapid breakdown of bacterial energy resources. 
In this respect, lugdunin resembles daptomycin, which has been 
shown to cause parallel cessation of all four metabolic pathways*? 
and whose exact mode of action is still unknown. During lugdunin 
purification, a 2-p-allo-isoleucine variant (instead of 2-p-valine) 
was isolated. This derivative with an additional methyl group and 
stereogenic centre was produced by S. lugdunensis in small amounts 
and was inactive. Development of resistance was not observed in 
S. aureus during continuous serial passaging in the presence of 
subinhibitory concentrations of lugdunin over 30 days (Fig. 3b). In 
contrast, S. aureus rapidly developed resistance to rifampicin within a 
few days of exposure (Fig. 3b). 


The C-terminal reductase domain of LugC catalyses aldehyde formation 
at the thioester of the terminal valine and enables subsequent ring closure 
of the thiazolidine, leading to the cyclopeptide. c, Chemical structure of 
lugdunin. Cyclization of the reactive C-terminal aldehyde and N-terminal 
cysteine is assumed to occur via thiohemiacetal to the thiazolidine 
building block (Extended Data Fig. 4d). 


The capacity of lugdunin to cure infections in vivo was analysed 
in a mouse skin infection model reflecting a typical human S. aureus 
infection and a common indication for antibiotic treatment??™. 
The back skin of shaved black-6 mice (C57BL/6) was superficially 
damaged by multiple stripping with adhesive tape and was infected with 
S. aureus. Then, 24, 30, and 42h after infection, mice were treated 
with 1.5 .g lugdunin per time point and infection site, and 6h later 
mice were euthanized. Lugdunin treatment led to a strong reduction 
or even complete eradication of viable S. aureus on the surface and in 
the deeper layers of the skin (Figs 3c, d), demonstrating that lugdunin 
eradicates S. aureus and penetrates tissues in vivo. Note that infection 
was not affected by lugdunin in two samples, most probably because 
some animals removed the lugdunin ointment by intensive licking. 
Bacteria obtained from these skin samples exhibited unchanged 
susceptibility to lugdunin, indicating that they had not developed 
resistance. 


Table 1 | Lugdunin spectrum of activity 


Lugdunin MIC 
Species and strain Resistance (wgml~!) 
Staphylococcus aureus USA300 (LAC) MRSA 1.5 
+ 50% human serum 15 
Staphylococcus aureus USA300 MRSA 1.5 
(NRS384) 
Staphylococcus aureus Mu50 GISA 3 
Staphylococcus aureus SA113 3 
Staphylococcus aureus RN4220 3 
Enterococcus faecium BK463 VRE 3 
Enterococcus faecalis VRE366 VRE 12 
Listeria monocytogenes ATCC19118 6 
Streptococcus pneumoniae 15 
ATCC49619 
Bacillus subtilis 168 (trpC2) 4 
Pseudomonas aeruginosa PAO1 >50 
Escherichia coli DH5a >50 


MRSA, methicillin-resistant S. aureus; GISA, glycopeptide intermediate-resistant S. aureus; 
VRE, vancomycin-resistant Enterococcus. 
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Figure 3 | Lugdunin has bactericidal activity, a low risk for resistance 
and efficacy in a mouse skin infection model. a, Killing curve. 
Incubation of S. aureus with a 10 x MIC of lugdunin leads to complete 
killing of the inoculum after 30h (detection limit is 10° colony-forming 
units (c.f.u.) per ml). Data represent medians + s.d. of three independent 
experiments. b, Serial passaging of S. aureus with sub-inhibitory 
concentrations of rifampicin leads to high resistance against rifampicin. 
However, development of resistance is not observed with lugdunin. 

A representative of two independent experiments is shown. c, d, Mouse 
skin infection. Lugdunin treatment (3 times, 1.5 jg per skin area) of 

S. aureus skin infections (5 mice corresponds to 10 skin punches per 
group) leads to strongly reduced numbers of viable bacteria after two days. 
c.f.u. for surface-attached bacteria (c) or bacteria located in the deeper skin 
tissue (d) per skin punch are shown. Horizontal lines represent the median 
of each group. Significant differences between groups were analysed by the 
Mann-Whitney test (*P < 0.05). 


Lugdunin production outcompetes S. aureus 

The production of antimicrobials, mostly plasmid-encoded riboso- 
mally synthesized bacteriocins, has been sporadically documented in 
individual bacterial strains from human microbiota’. However, the 
roles of such compounds in microbial fitness and in microbiota dynam- 
ics have remained largely unknown. To determine whether lugdunin 
contributes to the capacity of S. lugdunensis IVK28 to prevail in com- 
petition with S. aureus, the two species were co-cultivated on solid agar 
surface, promoting lugdunin production, and bacterial numbers were 
monitored for three days. 

As shown in Fig. 4a, the lugdunin-producing IVK28 wild type over- 
grew S. aureus efficiently, even when the inoculum contained tenfold 
more S. aureus than S. lugdunensis cells. No viable S. aureus cells were 
recovered after three days, indicating complete killing by S. lugdunensis. 
In contrast, IVK28 AlugD could not outcompete S. aureus (Fig. 4b) 
and was even overgrown when inoculated at tenfold higher numbers 
than S. aureus (Fig. 4d). The S. aureus-eradicating capacity of AlugD 
could be largely restored by complementation with /ugD on a plasmid 
(Fig. 4c). These data demonstrate that S. lugdunensis 1VK28 wild type 
can effectively eradicate S. aureus and that lugdunin production is 
responsible for this trait. 

Nasal carriage is known to be a major risk factor for invasive 
S. aureus infections'*?. To explore whether S. Iugdunensis can inter- 
fere with nasal S. aureus colonization in vivo in vertebrates, the noses 
of cotton rats, a well-established animal model for investigating 
S. aureus nasal colonization*?, were instilled with mixtures of 
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Figure 4 | Lugdunin-producing wild-type S. lugdunensis 1VK28 
restricts the growth of S. aureus in vitro and in vivo in cotton rats. 
a, S. aureus is overgrown by wild-type S. lugdunensis 1VK28 (dark or light 
grey columns, respectively) on agar plates inoculated at ratios of ~90:10. 
b, In contrast, the ratio between IVK28 AlugD and S. aureus, inoculated 
at a ratio of ~90:10, does not change significantly over time. c, The 
capacity of S. lugdunensis AlugD to overgrow S. aureus is largely restored 
by plasmid-encoded lugD. d, S. aureus displaces IVK28 AlugD even 
when inoculated at ratios of ~10:90. Data represent mean values + s.d. of 
three independent experiments. Significant differences between starting 
conditions and indicated time points were analysed by one-way ANOVA 
(*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001; NS, not significant). 
e, Cotton rat noses (8 animals per group) co-colonized by S. aureus and 
S. lugdunensis 1VK28 wild type show significantly less S. aureus c.f.u. 
after five days compared to S. lugdunensis IVK28 AlugD. Horizontal lines 
represent the median of each group. Significant differences, calculated by 
the Mann-Whitney test, are indicated (*P < 0.05). 


S. lugdunensis 1VK28 wild type or AlugD plus S. aureus. The three 
test strains colonized cotton rat noses stably over the 5-day test period 
when instilled individually (Extended Data Fig. 7). However, when the 
two species were co-inoculated, significantly less S. aureus cells were 
retrieved from animals co-colonized by IVK28 wild type compared 
to those co-colonized with AlugD (Fig. 4e). This finding indicates 
that lugdunin production can effectively interfere with S. aureus 
colonization in vivo. 
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Table 2 | S. aureus and S. Jugdunensis distribution in hospitalized 
patients 


S. lugdunensis- S. lugdunensis- 


positive negative Total 
S. aureus-positive 1 59 60 
S. aureus-negative 16 Til 127 
Total 17 170 187 
Risk8 0.059* 0.347% 0.321 


Significant differences between S. /ugdunensis-positive and S. /ugdunensis-negative patients were 
analysed by the Chi-squared test. 

§0.059 versus 0.347: risk ratio, 0.169; 95% confidence interval 0.025-1.147. 

*P=0.015. 


Interference with S. aureus carriage 

Factors governing human nasal S. aureus carrier status have remained 
largely unknown**“*. To investigate whether the presence of 
S. lugdunensis in the human nose can prevent co-colonization by 
S. aureus, we examined nasal swabs from 187 hospitalized patients for 
colonization by S. lugdunensis, S. aureus or both. The overall colonization 
rates of S. aureus and S. lugdunensis were 32.1% and 9.1%, respectively 
(Table 2), which corresponds to published data?°*>, Of note, the 
percentage of S. aureus colonization in S. lugdunensis-positive indi- 
viduals (5.9%) was approximately 5.9-fold lower than in S. lugdunensis- 
negative individuals (34.7%). Chi-squared statistical analysis showed 
that the reduced S. aureus detection rate in the presence of S. lugdunensis 
for the entire study population was significant (risk ratio 0.169; 95% 
confidence interval 0.025-1.147; P=0.015; Table 2) indicating that 
strong interference precludes the simultaneous presence of S. aureus 
and S. lugdunensis in the human nose. Consistent with this finding, 
PCR analysis demonstrated that all nasal S. lugdunensis isolates con- 
tained the /ug operon and all of the tested S. aureus strains (30 nasal as 
well as 7 laboratory) were found to be highly susceptible to lugdunin. 
Together with the ability of the lugdunin producer IVK28 to reduce 
S. aureus numbers in cotton rat noses, these data provide strong evi- 
dence for the potent capacity of S. lugdunensis to prevent human nasal 
colonization by S. aureus. 


Conclusions 

The human predisposition to S. aureus nasal carriage is governed by 
several host genetic or microbiota-related factors, which may affect the 
capacity of S. aureus to adhere to and multiply on nasal epithelia***“°. 
Here we provide evidence for a crucial role of S. lugdunensis and its 
antimicrobial product, lugdunin, in preventing S. aureus colonization 
of the human nose. S. aureus can only rarely be isolated from the axillae 
or groin, also habitats of S. lugdunensis in addition to nares*, which 
may therefore be related to the production of lugdunin at these sites. 
Accordingly, utilizing lugdunin or lugdunin-producing commensal 
staphylococci could become a valuable strategy for preventing S. aureus 
colonization and invasive infections in high-risk patients, such as those 
undergoing elective surgery, immunosuppression, or regular haemo- 
dialysis. The concept of probiotic bacteria interfering with predispo- 
sition to infection has been pursued mostly for enteric pathogens”. 
Our study suggests that the probiotics concept should be extended to 
body sites other than the gut, such as the nasal mucous membranes, 
and to additional major pathogens such as S. aureus. S. lugdunensis 
is known as a rare cause of opportunistic infections*®, but a mutant 
lacking all potential virulence factors or introduction of the lug genes 
into an exclusively commensal species might enable the development 
of a safe probiotic strain. 

Lugdunin represents the first example, to our knowledge, of a new 
class of compounds that we suggest should be named macrocyclic 
thiazolidine peptide antibiotics. Lugdunin shows bactericidal activ- 
ity against many major pathogens combined with a particularly high 
barrier to resistance by mutation, even under prolonged selective 
pressure. The fact that all tested nasal and clinical S. aureus isolates 
maintained pronounced susceptibility to lugdunin suggests that it may 
be particularly difficult for bacteria to develop resistance to lugdunin. 
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This indicates that lugdunin has apparently evolved for the purpose of 
bacterial elimination in the human organism, implying that it is opti- 
mized for efficacy and tolerance at its physiological site of action. Thus, 
lugdunin shows promise as a potential drug for inhibiting growth of 
S. aureus in the nares and potentially other body sites. 

The genetic inventory of human-associated microbiomes is of great 
importance for many critical human body functions for example, deg- 
radation of ingested polymers, detoxification of xenobiotics, and release 
of immunomodulatory molecules’. Our discovery of commensal bac- 
teria producing a potent antibiotic is in accord with recent reports on 
the presence of gene clusters for complex secondary metabolites in 
human-associated metagenomes*” and suggests that many humans are 
constantly exposed to potent bioactive compounds from microbiota. 
The effect of such compounds on human body functions may vary sub- 
stantially with the dynamic changes in microbiota structure. Whereas 
it has become increasingly difficult to identify novel compound struc- 
tures from soil microbes such as actinomycetes, bacteria from human 
microbiota may become a valuable source for new types of antibiotics. 
It will be a challenge for future research to elucidate the identity, vari- 
ability, activity, and ecological roles of such compounds and to exploit 
them for the development of new drugs. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. The animal experiments were not randomized and the investiga- 
tors were not blinded to allocation during experiments and outcome assessment. 
No statistical methods were used to predetermine sample size. For the human 
colonization study, only anonymous and randomized samples were obtained. 
Strains and growth conditions. The Staphylococcus strains used in this study 
were S. aureus USA300 LAC, S. aureus USA300 NRS384, S. aureus Mu50, 
S. aureus RN4220, S. aureus SA113, S. aureus Newman, S. aureus PS187, 
S. lugdunensis IVK28, S. lugdunensis 1VK28 AlugD, and S. lugdunensis IVK28- 
Xyl. Further strains used for MIC determination were Enterococcus faecium 
BK463, E. faecalis VRE366, Listeria monocytogenes ATCC19118, Streptococcus 
pneumoniae ATCC49619, Pseudomonas aeruginosa PAOI, and Escherichia coli 
DH5«. E. coli DC10B was used as the cloning host and B. subtilis 168 (trpC2) was 
used for precursor incorporation studies. In addition, a set of 60 S. aureus and 
17 S. lugdunensis strains were isolated from diagnostic samples in the course of 
the colonization study described below. 

Basic medium (BM: 1% soy peptone, 0.5% yeast extract, 0.5% NaCl, 0.1% glu- 
cose and 0.1% K,HPOu, pH 7.2) was used as the standard growth medium. MIC 
determinations and killing assays were performed in Mueller Hinton Broth (MHB; 
Roth, Karlsruhe, Germany). For the identification of S. lugdunensis, selective 
S. lugdunensis medium (SSL) was used as previously described®’. When necessary, 
antibiotics were used at concentrations of 250 1gml for streptomycin, 10j.gml! 
for chloramphenicol, 2.5.gml! for erythromycin, and 100j1g ml“! for ampicillin. 
Bioactivity test. The antimicrobial activity against S. aureus of S. lugdunensis 
IVK28 was identified by screening 90 nasal staphylococcal isolates for the capac- 
ity to inhibit growth of S. aureus. For this purpose, BM agar was inoculated 
1:10,000 with an overnight culture of S. aureus USA300 LAC. The test strains 
were inoculated on the resulting bacterial lawn, and the plates were incubated 
for 24-48 h at 37°C. To investigate the production of antimicrobial activity by 
IVK28 under iron-limiting conditions, BM agar was supplemented with 200 1M 
2, 2’-bipyridine”. 

Transposon mutagenesis and elucidation of the lugdunin gene cluster. The 
temperature-sensitive plasmid pT V 1ts, which contains the 5.3-kb transposon 
Tn917 (erm®) from E. faecalis, was transferred into S. lugdunensis IVK28 by electro- 
poration. Transposon mutants were screened for loss of antimicrobial activity 
against S. aureus. Chromosomal DNA was isolated by standard procedures from 
non-inhibitory clones, and the primers Tn917 up and Tn917 down (Extended 
Table 2) were used to directly sequence the flanking regions of the transposon 
insertion site. Sequence analysis was performed with DNASTAR Lasergene soft- 
ware (DNASTAR Inc., Madison, WI, USA). Bioinformatic analysis was performed 
by BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi) and antiSMASH 3.0 (ref. 36). 

Generation of S. lugdunensis IVK28-Xyl. The flanking regions of lugR were 
amplified by PCR with the primer pairs SIPr1-up/SIPr1—down and SIPr2-up/ 
SIPr2-down (Extended Data Table 2). The plasmid pBASE6-erm/lox 1, a deriv- 
ative of pBASE6 (ref. 51), already containing an erythromycin resistance cassette 
in the singular Smal site, was linearized with Acc65I. The identically digested 
SIPr1 PCR product, containing one natural Acc65I restriction site and one intro- 
duced by the primer, was ligated into pBASE6-erm/lox1. The resulting vector 
with the correctly oriented SIPr1 PCR product and the SIPr2 PCR product were 
ligated after digestion with EcoRV and BglII. The resulting pBASE6-erm/lox 1 
construct with both flanking regions inserted was linearized with BssHIL treated 
with Klenow enzyme and digested with BglII. The required xylR fragment with 
the downstream-located xylAB-promoter was excised from pT X15 by HindIII 
restriction, treated with Klenow enzyme and subsequently digested with BamHI. 
The ligation of the xyJR fragment into the appropriate vector generated pBASE6- 
erm/lox1-xylR, which was transferred into E.coli DC10B and subsequently into 
S. aureus PS187. The resulting plasmid pBASE6-erm/lox1-xylR was transduced 
into S. lugdunensis IVK28 via the bacteriophage ©187 according to ref. 52. 
Homologous recombination for replacement of lugR by erm/xylR was performed 
as previously described*!, generating the xylose-inducible lugdunin producer 
strain S. lugdunensis [VK28-Xyl. 

Production and purification of lugdunin. A fresh overnight culture of 
S. lugdunensis I1VK28-Xy] was inoculated 1:1,000 in BM without glucose and was 
supplemented with 0.5% xylose. After incubation at 37 °C under continuous shak- 
ing (160 rpm) for 24h, whole cultures were extracted with 1-butanol at a ratio of 
5:1. The aqueous phase was discarded, and the organic phase was evaporated at 
37°C under reduced pressure and finally dissolved in methanol. The methanol 
extract was applied to a gel filtration column (Sephadex LH20, 1.6 x 80cm, flow 
rate 1 ml min“! methanol). The active fractions containing lugdunin were pooled, 
evaporated at 37°C under reduced pressure and dissolved in dimethyl sulfoxide 
(DMSO). This solution was then subjected to a preparative reversed-phase HPLC 
column (Kromasil C18, 7 jum, 250 x 20 mm; Dr. Maisch, Ammerbuch, Germany) 
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with an isocratic elution at 79% methanol in water for 20 min. The fractions 
containing lugdunin were baseline-separated from the remaining compounds. 
Methanol was evaporated at 37°C under reduced pressure to yield lugdunin as 
a white powder. 

Chemical synthesis of lugdunin. Total chemical synthesis was achieved by a 
Fmoc (9-fluorenylmethoxycarbonyl) strategy-based manual solid-phase peptide 
synthesis and was established on an H-Val-H NovaSyn TG resin (Novabiochem, 
Switzerland). Amino acids were coupled in a fourfold excess using HATU 
(1-[bis(dimethylamino) methylene]-1H-1,2,3-triazolo[4,5-b] pyridinium 3-oxid 
hexafluorophosphate). Valine positions were coupled twice by PyOxim ([ethyl 
cyano(hydroxyimino)acetato-O”tri-1-pyrrolidinylphosphonium hexafluorophos- 
phate) for the second coupling instead of HATU. Deprotection was performed in 
trifluoroacetic acid for 30 min. Peptides were cleaved from the resin with acetoni- 
trile:water:trifluoroacetic acid (79.95:20:0.05) for 30 min. Lyophilisation yielded the 
crude product. Crude synthetic lugdunin product was purified by reversed-phase 
HPLC and compared with the natural product by electrospray ionization liquid 
chromatography high-resolution mass spectrometry, additional chiral-HPLC 
methods (column, Dr. Maisch Reprosil Chiral NR, Ammerbuch, Germany; elu- 
tion with 80% premixed methanol in H,O at 1.5 ml min! flow rate), bioactivity 
assay and advanced Marfey’s analysis. 

MIC assay, serum stability and spectrum of activity. S. aureus RN4220, S. aureus 
USA300 (LAC), S. aureus USA300 (NRS384), S. aureus SA113, S. aureus Mu50, 
E. coli DH5a and P. aeruginosa PAO] were grown overnight in MHB. E. faecalis 
VRE366, E. faecium BK463, S. pneumoniae, and L. monocytogenes were grown 
in tryptic soy broth (Difco Laboratories, Augsburg, Germany). Serum stability 
of lugdunin was determined in 50% MHB with 50% human serum, which was 
obtained by standard Histopaque/Ficoll centrifugation of freshly isolated blood. 
All strains were incubated at 37 °C under continuous shaking. Early log-phase 
grown bacteria were adjusted in MHB to 1 x 10° c.f.u. per ml in microtiter plates, 
mixed with varying concentrations of the antibiotic and incubated at 37°C for 24h 
under continuous shaking. The OD¢oo of each well was measured with a microplate 
reader, and the lowest peptide concentrations, which displayed no bacterial growth, 
were defined as the MIC. The assays were performed in 96-well microtiter plates. 
MIC values for B. subtilis 168 to be used as points of reference in precursor incor- 
poration studies were determined in Belitzky minimal medium, using a final 
inoculum of 5 x 10° c.f.u. per ml. 

Killing assay. Fresh MHB was inoculated 1:10,000 with an overnight culture of 
S. aureus USA300 LAC and was incubated at 37°C under continuous shaking 
(160 rpm) until bacteria were grown to 1 x 10° c.f.u. per ml. Then, 10 x MIC lug- 
dunin was added. At the time points 0h, 2h, 4h, 8h, 24h and 30h, samples were 
taken and centrifuged. The pellet was resuspended in 1 x PBS and serially diluted. 
The dilutions were spotted on tryptic soy agar, and colony counts were determined 
after overnight incubation at 37 °C. To determine cell numbers <10? c.f.u. per ml, 
whole cultures were centrifuged and plated on tryptic soy agar. 

Precursor incorporation studies. B. subtilis 168 (trpC2) is a widely used model 
organism for mode of action investigations and was also used for orienting stud- 
ies on the mechanism of lugdunin®!°°. To measure incorporation of radioactive 
precursors into acid-precipitable macromolecules, B. subtilis 168 (trpC2), expo- 
nentially growing in Belitzky minimal medium (ODg0oo of 0.04), was labelled 
with 0.02 MBq mI"! of [methyl-*H]thymidine (DNA synthesis), [5,6-*H] uridine 
(RNA synthesis), [4,5-°H]z-leucine (protein synthesis) or [1-°H] glucosamine 
p-hydrochloride (cell-wall synthesis). After 5 min culture aliquots were treated 
with lugdunin at concentrations of 2, 1, 0.67 or 0.5,g ml, corresponding to 1/2x, 
1/4x, 1/6x or 1/8 the MIC, respectively. Ciprofloxacin (2 1g ml!, 4 x MIC), 
rifampicin (11g ml~!, 4 x MIC), chloramphenicol (161g ml~!, 4 x MIC) or 
vancomycin (21g ml~!, 4 x MIC) served as reference antibiotics for inhibition of 
DNA, RNA, protein or cell-wall biosynthesis, respectively. Samples (0.1 ml) were 
taken in regular time intervals and precipitated with 6% perchloric acid in a multi- 
screen filter plate (Millipore, 0.45 1m). After washing the precipitates with 0.15 ml 
of ethanol, plates were dried and radioactivity was determined with scintillation 
fluid (Ultima Gold, Perkin Elmer) in a 1450 MicroBeta TriLux counter (Wallac). 

Cytotoxicity against eukaryotic cells. Human neutrophil granulocytes were 
freshly isolated from the blood of healthy volunteers by standard Histopaque/ 
Ficoll centrifugation. Lysis of neutrophil granulocytes was monitored by the release 
of the enzyme lactate dehydrogenase (LDH), as described in ref. 57. Lugdunin was 
added at final concentrations of 50, 25 and 12.5 1g ml! in 0.5% DMSO to wells 
of a 96-well tissue culturing plate containing 1 x 10° neutrophil granulocytes per 
well in 200 jul RPMI-1640 medium (2g 1! NaHCO, 10% foetal calf serum, 1% 
L-glutamine and 1% penicillin-streptomycin, PAN Biotech) without phenol red. 
The plates were incubated at 37 °C and 5% CO, for 3h and the lysis was determined 
with the LDH Cytotoxicity Detection Kit (Roche Applied Sciences, Mannheim, 
Germany). 
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Haemolytic activity was determined with human erythrocytes, freshly isolated 
from the blood of healthy volunteers by standard Histopaque/Ficoll centrifuga- 
tion. Lugdunin was added at final concentrations of 50, 25 and 12.5,.gml in 
0.5% DMSO to 2% erythrocytes in 1 x PBS. The cells were incubated for 1h and 
afterwards centrifuged for 10 min at 1,000 rpm. The supernatant was diluted 1:10 
in 1 x PBS, and the absorbance was measured at a wavelength of 540 nm. Asa 
positive control for neutrophil granulocyte or erythrocyte lysis, 2% Triton X-100 
was added to the samples. 

HL60 cells were cultured in RPMI-1640 medium. Their metabolic activity was 
monitored using the cell health indicator alamarBlue (Invitrogen) as described in 
ref. 58. Lugdunin was added at final concentrations of 50, 25 and 12.5gml1 in 
0.5% DMSO to wells of a 96-well tissue culturing plate containing 1 x 104 HL6O 
cells per well. The plates were incubated at 37 °C and 5% CO) for 24h. As a positive 
control for high cytotoxicity, staurosporine was added to the samples. The HL60 
cell line, which we have obtained from CNRS (Université Grenoble, France), and 
whose identity was confirmed by the provider, is not listed in the database as 
commonly misidentified. Mycoplasma assays were negative. 

Resistance development study. MIC assays for the antibiotics used in this study 
were performed as described above. We determined 1 x MICs of 0.01 ,.gml“! 
rifampicin and 1.51g ml! lugdunin against S. aureus USA300. Fresh MHB was 
inoculated 1:10,000 with an overnight culture of S. aureus USA300 LAC and was 
incubated at 37°C under continuous shaking. Cells were grown to early log phase, 
adjusted to 1 x 10° cellsml~!, and dispensed into 96-well microtiter plates with 
10011 per well. Lugdunin and rifampicin were added at concentrations of 0.25 x, 
0.5x, 1x, 1.5x,2x and 4x MIC. After 24h incubation at 37°C under continu- 
ous shaking, growth was determined with a microplate reader at an OD¢o0, and 
cells from the second highest concentration showing visible growth were used to 
inoculate the subsequent culture. 

Statistical analyses. Statistical analysis was performed by using GraphPad 
Prism (GraphPad Software, Inc., La Jolla, USA; version 5.04). Statistically sig- 
nificant differences were calculated by appropriate statistical methods as indi- 
cated. For the human study, risk of nasal colonization with S. aureus in the 
presence or absence of S. lugdunensis, as well as the respective point estimates 
of the risk ratio and confidence intervals, were determined using Stata version 
12.1 (Stat Corp., College Station, TX, USA). P values of <0.05 were considered 
significant. 

Animal models and ethics statement. All animal experiments were con- 
ducted in strict accordance with the German regulations of the Gesellschaft fiir 
Versuchstierkunde/Society for Laboratory Animal Science (GV-SOLAS) and the 
European Health Law of the Federation of Laboratory Animal Science Associations 
(FELASA) in accordance with German laws after approval (protocol HT 1/12 for 
mouse skin infection and T1/10 for cotton rat colonization) by the local authorities 
(Regierungsprasidium Tibingen). All, animal and human studies were carried out 
at the University Hospital Tubingen and conformed to institutional animal care and 
use policies. No randomization or blinding was necessary for the animal infection/ 
colonization models, and no samples were excluded. Animal studies were per- 
formed with female C57BL/6 mice, 6-8 weeks old, or cotton rats of both genders, 
8-10 weeks old, respectively. The human nasal colonization study was approved 
by the ethics committee of the medical faculty of the University Hospital Tubingen 
(project number 577/2015A). 

Skin infection of C57BL/6 mice. A streptomycin-resistant S. aureus Newman 
strain was used to infect C57BL/6 mice epicutaneously by the tape-stripping 
technique from ref. 59. Tryptic soy broth with 500 jg m1“! streptomycin was 
inoculated 1:10,000 with a fresh overnight culture of the test strain and was incu- 
bated at 37°C under continuous shaking until an ODg00 of 0.5 was reached. Cells 
were centrifuged, washed twice with 1 x PBS, and adjusted to 1 x 10° cells per 
ml. The integrity of the shaved skin of the mice was affected by repeated (seven 
times) vigorous tape stripping to enable S. aureus Newman infection. An inocu- 
lum of 15,11 from the bacterial suspension was added to 7-mm filter paper discs, 
placed onto the prepared skin with two discs per animal, and covered with Finn 
chambers on Scanpor tape (Smart Practise, Phoenix, AZ, USA). Finn chamber 
fixation occurred via Fixomull stretch plasters (BSN medical GmbH, Hamburg, 
Germany). After incubation for 24h, the Finn chambers were removed and 1.5 1g 
of lugdunin per colonized area was applied, followed by a second and third 
treatment with the same amount of lugdunin after 30h and 42h. Six hours after 
the final application, mice were euthanized, the skin was large-scale detached 
and 4-mm punches of the originally colonized areas were vortexed in 1 x PBS for 
30s to remove the attached bacteria from the skin (wash fraction). The skin was 
dissected with a scalpel to expose bacteria from deeper areas of the skin (tissue 
fraction), which was homogenized by vortexing in 1 x PBS for 30s. c.f.u. of both 


fractions were determined by serial dilutions in 1 x PBS, which were spotted 
onto tryptic soy agar, supplemented with streptomycin, for a streptomycin- 
resistant S. aureus Newman-specific selection. The plates were incubated 
overnight at 37°C. 

Generation of S. lugdunensis AlugD and complementation. For the construction 
of a marker-less knockout strain, 1-kb flanking regions of IugD were amplified by 
PCR with the primer pairs JugD upstream-SacI/lugD upstream-Acc65] and lugD 
downstream-Acc65I/lugD downstream-BglII (Extended Data Table 2). The frag- 
ments were digested according to their introduced restriction sites and were ligated 
into the plasmid pBASE6 generating pBASE6-AlugD, which was transferred to 
E. coli DC10B. The correct plasmid was transferred to S. aureus PS187 by electropo- 
ration, which was then infected with the bacteriophage ©187 for the transduction 
of pBASE6-AlugD into S. lugdunensis IVK28 wild type. The knockout was gener- 
ated by homologous recombination of the flanking regions into the genome, and 
deletion of IugD was confirmed by PCR. For the complementation of the mutant, 
lugD was amplified by the primer pair JugD complementation forward-PstI/lugD 
complementation reverse-Acc65I (Extended Data Table 2), digested with the 
appropriate restriction enzymes and ligated into identically digested pRB474. 
The resulting pRB474-lugD was transduced into S. lugdunensis IVK28 AlugD, as 
described for the knockout mutant. 

Competition assay. S. lugdunensis IVK28 wild type, S. lugdunensis IVK28 
AlugD, S. lugdunensis [VK28 AlugD::pRB474-lugD, and a streptomycin-resistant 
S. aureus Newman were grown in BM overnight at 37°C under continuous 
shaking. These strains were then adjusted to 1 x 10° c.f.u. per ml in 1 x PBS 
and diluted 1:10. For the starting condition of 90% S. aureus, equal volumes of 
1 x 10° S. aureus c.f.u. per ml and 1 x 108 S. lugdunensis c.f.u. per ml were mixed. 
Co-cultures with only 10% S. aureus were also performed, and 2011 of these 
mixtures were spotted in triplicate on BM agar and incubated at 37°C. Samples 
were taken at 0h, 24h, 48h, and 72h by scraping cells from the agar plates and 
suspending them in 1 x PBS. Serial dilutions of these samples were plated on 
BM and BM containing streptomycin for selection of S. aureus. After overnight 
incubation at 37°C, colony counts were determined, and the bacterial ratios of 
S. aureus and S. lugdunensis were calculated. 

Co-colonization of cotton rat noses. For the colonization of cotton rat noses, 
spontaneous streptomycin-resistant mutants of S. lugdunensis 1VK28 wild type 
and S. lugdunensis 1VK28 AlugD were selected on BM agar plates containing 
250,1g ml! streptomycin. Co-colonization was conducted with a streptomycin- 
resistant S. aureus Newman. The cotton rat model was described previously’. 
As the capacity of S. lugdunensis to colonize cotton rat nares has not been studied 
before, we determined the inoculum required for stable colonization by IVK28 wild 
type and its mutant AlugD over 5 days. Our previous studies have shown that for 
S. aureus an inoculum of 107 bacteria per nose results in a stable colonization of 
about 10° c.f.u. per nose (Extended Data Fig. 7a). To achieve a comparable coloni- 
zation level with S. lugdunensis, an inoculum of 10° bacteria per nose was required, 
and there was no detectable difference in colonization efficiency between wild type 
and AlugD (Extended Data Fig. 7b, c). Therefore, co-colonization experiments in 
cotton rat noses were performed with tenfold more S. lugdunensis than S. aureus 
to obtain a 1:1 colonization ratio. 

Cotton rats were anaesthetized and instilled intranasally with mixtures 

of either 1 x 10° S. lugdunensis wild type and 1 x 10’ S. aureus Newman or 
1 x 108 S. lugdunensis AlugD and 1 x 10’ S. aureus Newman. Five days after 
bacterial instillation, the animals were euthanized, and noses were surgically 
removed. The noses were heavily vortexed in 1 ml of 1 x PBS for 30s. Dilutions 
of the samples in PBS were plated on SSL agar containing 250g ml“! strep- 
tomycin to select for the used strains and to separate S. aureus (yellow) and 
S. lugdunensis (purple) by colour. The plates were incubated for 2 days under 
anaerobic conditions (anaerobic jar with Anaerocult A, MerckKGaA), for 
the specific detection of ornithine decarboxylase activity. S. aureus Newman 
c.f.u. were determined afterwards. All animals received drinking water with 
2.5mg ml! streptomycin continuously, starting 3 days before the experiment, 
to reduce the natural nasal flora. 
Human colonization study. A total of 187 nasal swab samples from hospitalized 
patients were received from the diagnostics laboratory of the Institute of Medical 
Microbiology and Hygiene (University Hospital Tiibingen, Germany). Dilutions 
from each sample were plated on blood agar and SSL agar for a phenotypic iden- 
tification of S. aureus and S. lugdunensis. Identity was confirmed by coagulase test 
and matrix-assisted laser desorption/ionization-time-of-flight mass spectrome- 
try (mass spectrometer: AXIMA Assurance, Shimadzu Europa GmbH, Duisburg, 
database: SARAMIS with 23.980 spectra and 3.380 superspectra, BioMérieux, 
Nuertingen). 
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Extended Data Figure 1 | Gene cluster of lugdunin and generation 4-phosphopantetheine cofactor. Encoded downstream is a putative 
of S. lugdunensis IVK28-Xy]l. a, The lugdunin genes are located ona monooxygenase (JugM). The transposon insertion site of Tn917, 
30-kbp operon. lugA-D encode the four NRPS, which are preceded by generating the lugdunin-deficient mutant IVK28 M1, is indicated by a red 
the putative regulator gene JugR. Encoded upstream are putative ABC arrow. b, The xylose-inducible lugdunin producer strain S. lugdunensis 
transporter genes (JugE-H) and genes for proteins with no described IVK28-xyl was generated by replacement of the regulator gene JugR by 
function (Jugl and lug/). A type-II thioesterase, that functions as a the xylAB promoter along with the xylR gene encoding a xylose-sensitive 
repair enzyme for stalled PCP domains, is encoded between lugC and repressor. The erythromycin resistance cassette ermB was integrated for 
lugD. The 4’-phosphopantetheinyl transferase LugZ converts inactive selection purposes. 


PCPs (apo-PCP) into the active holo-form by attachment of the 
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Extended Data Figure 2 | Structure elucidation by multistage tandem 
and electrospray ionization high-resolution mass spectrometry. 

a, A single-stage high-resolution MS/MS experiment revealed a 
superposition of fragment ions typically found for cyclic peptides. Quasi 
molecular ion selected for fragmentation is marked with a blue rhombus 
(m/z= 783.45). For sequence annotation, fragment ions were searched for 
b-ions of high intensity. One fragmentation route is annotated exemplarily 
by highlighting respective a-, b- and c-ion series signals. Thia, thiazoldine. 
b, Data generated by multistage tandem mass spectrometry show that 
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lugdunin mainly fragmented along four routes (blue digits; (I), (II), (II), 
(IV)). Initial protonation occurs at the secondary amine of thiazolidine 
(blue H atom). Proton transfer to nearby peptide bonds initiated ring 
cleavage with subsequent fragmentation. Initial loss of ammonia for 
fragmentation route (I) can be explained by a precedent six-membered 
transition state. Intensities are in arbitrary units. Blue rhombuses label the 
position of initial ring cleavage. Fragmentation route molecules are shown 
in linearized form. 
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Extended Data Figure 3 | NMR spectra of the natural product lugdunin. 
The two diastereomeric and interconvertible forms of lugdunin (see 

imine intermediate, Extended Data Fig. 4d) show distinctive sets of NUR 
signals with consistent patterns, which were assigned by two-dimensional 
NMR methods (COSY, HMBC, HSQC-DEPT, ROESY and TOCSY) and 
corroborated the thiazolidine heterocycle as a special feature of lugdunin. 
Chemical shifts (6) are shown in p.p.m. a, 'H NMR spectrum (600 MHz) 
of lugdunin in DMSO-d6 at 308 K. b, °C NMR spectrum (150 MHz) of 
lugdunin in DMSO-d6 at 308 K. Atom numbering refers to full spectral 
assignment shown in Extended Data Table 1. c, Expansion of HMBC 


spectrum (heteronuclear multiple bond correlation) shows distinct 
correlations between the amino acid a-proton and the carbonyl C-atom 
of the respective amino acid. d, Expansion of ROESY spectrum (rotation 
frame nuclear Overhauser effect spectroscopy) shows short-ranged 
correlations through space between a-protons and amino acid amide 
protons. e, Taken together, HMBC (green arrows) and ROESY (double 
headed blue arrows) correlations allowed for a full sequential walk along 
the peptide backbone, which readily confirmed the amino acid sequence 
of both lugdunin diastereomers (the sequential walk is exemplarily shown 
for one diastereomer). 
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Extended Data Figure 4 | The thiazolidine moiety of lugdunin and its 
formation by peptide cyclisation. a, Blue rhombus marks the b2-ion 
from fragmentation route I (Extended Data Fig. 2). The ion was selected 
from an in-source collision-induced decay (isCID) experiment and was 
further fragmented in order to find thiazolidine-specific fragment ions. 
High-resolution MS data are shown with annotated sum formulas and 
respective deviations from calculated masses (errors in p.p.m.). Shown 
fragment ions reveal the b1-ion at m/z = 170.0635 Da. Intensity is shown 
in arbitrary units. b, Photometric detection of thiazolidines at 670 nm. 

A positive colour reaction with 3-methyl-2-benzothiazolone hydrazone 
hydrochloride (Sawicki reagent) gives rise to detectable absorbance at 
670nm. Thiazolidine-4-carboxylic acid (thiazolidine) was chosen as 
positive control, whereas 2-methyl-2-thiazoline (thiazoline) and DMSO 
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acted as negative controls. c, Observable colours for the thiazolidine 
detection reaction. Thiazolidine-4-carboxlic acid (blue), lugdunin (green) 
and negative controls (yellow). d, The terminal reductase of LugC is 
proposed to initiate cleavage of the thioester-bound peptide chain with 
the aid of an NAD(P)H cofactor. The mature heptapeptide is liberated 
reductively from the NRPS multienzyme complex and cyclises via the 
N-terminal amine (L-Cys) and C-terminal aldehyde (L-Val) to form a 
macrocyclic imine/Schiff base. Subsequent nucleophilic attack of the 
cysteine thiol group generates the five-membered thiazolidine heterocycle. 
Nucleophilic attack by the L-cysteine sulfhydryl group may occur either 
at the re or si face of the imine thus leading to a diastereomeric mixture of 
two structural populations (depicted with wavy bond). LugC is shown as 
truncated gene, as indicated by the two lines (||). 
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Extended Data Figure 5 | Lugdunin activity against eukaryotic cells. 

a, b, Human neutrophil granulocytes (a) or erythrocytes (b) were 
incubated with high concentrations of lugdunin. Their lysis was monitored 
by the release of the enzyme lactate dehydrogenase (a) or haemoglobin (b), 
respectively. Cells without lugdunin were used as negative control. 
Incubation of cells in 2% Triton X-100 was used as positive control for 
high lysis. The data represent three independent experiments + s.d. 


c, Promyelocytic leukaemia (HL60) cells were incubated with high 
concentrations of lugdunin. In order to determine the 50% inhibitory 
concentration of lugdunin, the metabolic activity of HL60 cells was 
measured by the conversion of resazurin into the highly fluorescent 
resorufin. The apoptosis-inducing staurosporine was used as positive 
control. ICs9 values were calculated from the means of three independent 
experiments. 
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Extended Data Figure 6 | Time course of incorporation of tritium- 
labelled metabolic precursors into lugdunin-treated B. subtilis. 

B. subtilis 168 (trpC2) is a widely used model organism for mode of action 
investigations and was also used for orienting studies on the mechanism of 
lugdunin. Susceptibility of B. subtilis and S. aureus to lugdunin is similar 
(Table 1), with an MIC for B. subtilis of 441g ml in the Belitzky minimal 
medium used in this assay. a-d, Incorporation of thymidine into DNA (a), 
uridine into RNA (b), leucine into protein (c) or N-acetylglucosamine into 
peptidoglycan (d) ceased within minutes during lugdunin treatment. The 
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experiment was repeated on three days with three independent bacterial 
cultures. One representative experiment is shown. At a concentration of 
half the MIC (1/2 MIC), incorporation of precursors into all pathways 
ceased reproducibly. At 1/8 MIC, incorporation continued repeatedly in 
parallel with the untreated control. At 1/4 MIC, the depicted experiment 
shows protein and peptidoglycan syntheses slightly more impaired than 
DNA and RNA syntheses, whereas the opposite occurred in another 
experiment. In summary, all four metabolic pathways seem to be equally 
impaired by lugdunin. 
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Extended Data Figure 7 | Nasal colonization rates by S. aureus and efficiency to colonize the noses of cotton rats (5 or 6 animals per group). 
S. lugdunensis in cotton rats. a—c, Different inocula of S. aureus c.f.u. of each strain were determined per nose after 5 days and plotted as 
Newman (a), S. lugdunensis IVK 28 wild type (b) and S. lugdunensis individual dots. Lines represent the median of each group. 


IVK 28 AlugD (c) were instilled intranasally to determine their 
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Extended Data Table 1 | NMR spectral assignment of lugdunin diastereomers 


ARTICLE 


Diastereomer | 


Diastereomer Il 


C-Atom be 5u (mult., J in Hz) be 5 (mult., J in Hz) C-Atom 
1 L-Cys-Thiazolidine 
2-NH - 2.88 (1H, m) - n.d. 2-NH 
1 38.2 2.58 (1H, m) 37.5 2.54 (1H, m) 1 
3.06 (1H, dd, 9.7, 6.4) 3.17 (1H, dd, 9.2, 6.0) 
2 63.3 3.89 (1H, m) 65.3 3.69 (1H, m) 2 
3 169.8 - 171.5 - 3 
2 p-Val 
4-NH - 8.16 (1H, d, 9.8) - 8.52 (1H, d, 8.6) 4-NH 
4 57.2 4.18 (1H, dd, 10.0, 8.4) 60.5 3.70 (1H, m) 4 
5 1.65 (1H, m) 28.5 1.71 (1H, m) 5 
G7 18.6, 18.0 0.47 (3H, d, 6.6), 0.63 (3H, d, 6.6) 18.5, 19.3 0.52 (3H, d, 6.8), 0.84 (3H, m)* G7 
8 169.7 - 170.5 - 8 
3L-Trp 
9-NH - 8.22 (1H, d, 8.5) - 8.52 (1H, d, 7.4) 9-NH 
9 52.7 4.70 (1H, ddd, m) 53.4 4.52 (1H, m) 9 
10 28.0 2.89 (1H, m) 26.4 2.82 (1H, dd, 14.7, 4), 10 
3.01 (1H, dd, 14.0, 5.5) 3.36 (1H, dd, 14.7, 4) 
11 109.5 - 110.2 - 11 
12 127.1 - 126.8 - 12 
13 118.2 7.66 (1H, d, 7.9) 1179 7.51 (1H, d, 7-9) 13 
14 117.9 6.95 (1H, ddd, 7.9, 7.8, 1.1) 118.1 6.96 (1H, ddd, 7.9, 7.9, 1.0) 14 
15 120.5 7.02 (1H, ddd, 7.9, 7.8, 1.1) 120.7 7.04 (1H, ddd, 8.0, 7.9, 1.0) 15. 
16 110.9 7.28 (1H, dd, 7.9, 1.0) 111.1 7.30 (1H, dd, 8.0, 1.0) 16 
1¥ 136.0 - 136.1 - ci 
18-NH - 10.77 (1H, d, 2.0) - 10.80 (1H, d. 2.0) 18-NH 
18 124.1 7.15 (1H, d, 2.0) 123.6 7.15 (1H, d, 2.0) 18 
19 170.8 - 170.1 - 19 
4D-Leu 
20-NH - 8.14 (1H, d, 8.7) 7.78 (1H, d, 9.4) 20-NH 
20 50.7 4.46 (1H, ddm) 52.2 4.41 (1H, m) 20 
21 41.3 1.32 (1H, m), 1.40 (1H, m) 41.1 1.30 (1H, m), 1.37 (1H, m) 21 
22 24.1 1.28 (1H, m) 23.9 1.24 (1H, m) 22 
23, 24 19.1, 19.3 0.84 (3H, m), 0.87 (3H, m) 20.8*, 21.6* 0.79 (3H, m), 0.77 (3H, m) 23, 24 
25 171.6 - 1714 - 25 
5L-Val 
26-NH - 7.99 (1H, d, 9.2) - 7.69 (1H, d, 9.3) 26-NH 
26 56.9 4.39 (1H, m) 57.4 4.24 (1H, dd, 9.2, 9.2) 26 
27 31.2 1.96 (1H, m) 29.7 1.89 (1H, m) 27 
28, 29 19.1,17.9 0.80 (3H, m), 0.83 (3H, m) 19.1, 19.3 0.85 (3H, m)*, 0.84 (3H, m)* 28, 29 
30 170.2 - 171.0 - 30 
6 D-Val 
31-NH - 0.77 (1H, 4, 6.7) - 8.02 (1H, d, 7.8) 31-NH 
31 56.5 4.50 (1H, dd, 8.8, 4.6) 59.1 4.04 (1H, dd, 7.8, 7.8) 31 
32 31.9 1.97 (1H, m) 29.0 1.98 (1H, m) 32 
33, 34 17.4, 19.1 0.83 (3H, d, 6.5), 0.88 (3H, d, 6.5) 18.3, 19.1 0.88 (3H, m), 0.84 (3H, m) 33, 34 
35 170.4 - 1714 - 35 
7 L-Val-Thiazolidine 
36-NH - 7.74 (1H, d, 9.2) - 7.43 (1H, d, 9.3) 36-NH 
36 57.2 3.70 (1H, m) 54.4 4.04 (1H, dm, 9.3) 36 
37 29.2 1.99 (1H, m) 32.5 1.55 (1H, m) a7 
38, 39 15.2,22.3 0.75 (3H, d, 6.9), 0.79 (3H, d, 6.9) 19.8,19.7 0.86 (3H, d, 6.7), 0.92 (3H, d, 6.7) 38, 39 
40 72.0 4.60 (1H, dd, 10.6, 8.8) 72.1 4.73 (1H, dd, 13.1, 2.0) 40 


Chemical shifts (6) are given in p.p.m. Mult., multiplicity. Coupling constants (J) are given in Hertz (Hz). Asterisks (*) mark strong signal overlap. 
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Extended Data Table 2 | Primers used in this study 


Primer 


Tn917 up 

Tn917 down 

SIPri-up 

SIPr1-down 

SIPr2-up 

SIPr2-down 

lugD upstream-Sacl 
lugD upstream-Acc65l 
lugD downstream-Acc65l 
lugD downstream-Bglll 
lugD comp. forw-Pstl 


lugD comp. rev-Acc65l 


Primer sequence 5‘-3‘ Gene annotation 
in S. lugdunensis 
N920143 
ATAGGCCTTGAAACATTGGTTTAGTGGG --- 
CCCATAGATAAGAAATACACCTGCAATAACC --- 
TACGGTACCCGCTTAACAAGAT GACTAGC SLUG_RS03935 
TCTTTATGGTACCTATTACATCTCTAAAG SLUG_ RS03935 
ATTTGTATTGATATCATAAAAAATGTCCG SLUG_ RS03935 
GTTAGATCTAAAGGAGGTCAATCAGATGG SLUG_ RS03935 
TAGGAGCTCGCTTAATGAATTC SLUG_RS03965 
ATAGGTACCCTCCTTCTAGCTAAGC SLUG_RS03965 
AGTGGTACCCTCTATTAACTGAAAGG SLUG_RS03965 
ATTAGATCTGAAGTTAAGCATCCGTC SLUG_RS03965 
ATACTGCAGGCTTAGCTAGAAGGAGAG SLUG_RS03965 


AATGGTACCCATCAGCATTATAGTT SLUG_RS03965 


Application for 


Sequencing of transposon insertion site 

Sequencing of transposon insertion site 

Replacement of /ugR by xy/AP promoter and xy/R regulator gene 
Replacement of /ugR by xy/AP promoter and xylR regulator gene 
Replacement of /ugR by xy/AP promoter and xy/R regulator gene 
Replacement of /ugR by xy/AP promoter and xy/R regulator gene 
S. lugdunensis AlugD 

S. lugdunensis AlugD 

S. lugdunensis AlugD 

S. lugdunensis AlugD 

Complementation of S. /ugdunensis AlugD 


Complementation of S. /ugdunensis AlugD 
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Structural basis of Smoothened 
regulation by its extracellular domains 


Eamon F. X. Byrne!*, Ria Sircar?*, Paul S. Miller!, George Hedger’, Giovanni Luchetti*, Sigrid Nachtergaele’, Mark D. Tully’, 
Laurel Mydock-McGrane?®, Douglas F. Covey®, Robert P. Rambo’, Mark S. P. Sansom*, Simon Newstead**, Rajat Rohatgi? 


& Christian Siebold! 


Developmental signals of the Hedgehog (Hh) and Wnt families are transduced across the membrane by Frizzled- 
class G-protein-coupled receptors (GPCRs) composed of both a heptahelical transmembrane domain (TMD) and an 
extracellular cysteine-rich domain (CRD). How the large extracellular domains of GPCRs regulate signalling by the TMD is 
unknown. We present crystal structures of the Hh signal transducer and oncoprotein Smoothened, a GPCR that contains 
two distinct ligand-binding sites: one in its TMD and one in the CRD. The CRD is stacked atop the TMD, separated by an 
intervening wedge-like linker domain. Structure-guided mutations show that the interface between the CRD, linker 
domain and TMD stabilizes the inactive state of Smoothened. Unexpectedly, we find a cholesterol molecule bound to 
Smoothened in the CRD binding site. Mutations predicted to prevent cholesterol binding impair the ability of Smoothened 
to transmit native Hh signals. Binding of a clinically used antagonist, vismodegib, to the TMD induces a conformational 
change that is propagated to the CRD, resulting in loss of cholesterol from the CRD-linker domain-TMD interface. Our 
results clarify the structural mechanism by which the activity of a GPCR is controlled by ligand-regulated interactions 


between its extracellular and transmembrane domains. 


The extracellular region of Smoothened (SMO) is composed of an 
N-terminal CRD followed by a small linker domain, which then 
connects to the TMD and a C-terminal intracellular domain (ICD; 
Fig. 1a). Studies using small-molecule agonists and antagonists of 
SMO have defined two separable ligand-binding sites, one in the TMD 
and one in the CRD!. The TMD binding site binds the plant-derived 
inhibitor cyclopamine”’, the synthetic agonist SAG*®, and the anti- 
cancer drug vismodegib®, which is used clinically to treat advanced 
basal cell cancer. Side-chain oxysterols such as 20(S)-hydroxycholesterol 
(20(S)-OHC) represent a distinct class of SMO ligands’~® that activate 
signalling by engaging a hydrophobic groove on the surface of the SMO 
CRD!°-!?, The native morphogen Sonic Hedgehog (SHH) functions by 
binding and inactivating Patched 1 (PTCH1), the major receptor for 
Hh ligands, which suppresses SMO activity'’. Despite the discovery 
of numerous exogenous SMO ligands, no bona fide endogenous SMO 
ligand that regulates Hh signalling has been identified. Structure- 
guided mutations that disrupt 20(S)-OHC binding to the CRD groove 
or sterol-based inhibitors that occlude this groove impair signalling by 
SHH!!!, By contrast, several mutations in the TMD site that blocked 
the binding and activity of synthetic ligands failed to have any effect on 
the basal or SHH-stimulated activity of SMO’*"*. These data suggest 
that an endogenous SMO ligand that can regulate Hh signalling engages 
the CRD groove on SMO. 

Crystal structures of the isolated SMO linker domain-TMD in 
complex with both agonist and antagonist ligands'*"!” have shown 
that the GPCR heptahelical scaffold is conserved and provided a 
detailed view of a small-molecule binding pocket, but did not show the 
conformational changes typically associated with GPCR signalling!*””. 
In addition, two unliganded structures of the isolated SMO CRD have 
been solved!°”°, However, structural insights into how the extracellular 


domains and TMD interact to regulate signalling in SMO (or in any 
other GPCR) are lacking. 


Overall structure of SMO 

We determined the crystal structure of human SMO containing both 
the CRD and the TMD, connected by the juxta-membrane linker 
domain (SMOAC, Fig. 1a and Extended Data Fig. 1). To study the 
SMO TMD in a defined functional state and to reduce conformational 
flexibility, we included a single amino acid mutation, Val329Phe!®, in 
TMD helix 3 that locked SMO in an inactive state and substantially 
improved expression levels (Extended Data Fig. 2 and Supplementary 
Discussion). Using an established strategy in GPCR crystallography, 
the third intracellular loop (ICL3) between transmembrane helices 
5 and 6 was replaced by thermostabilized apocytochrome b562RIL 
(BRIL)?!. The SMOAC structure was determined to 3.2 A resolution 
(Extended Data Table 1 and Extended Data Fig. 3). The asymmetric 
unit, comprising two molecules arranged ‘head-to-tail, stacks into 
alternating hydrophobic and hydrophilic layers along one axis, as 
is typical for lipidic cubic phase (LCP)-derived crystals (Extended 
Data Fig. 3a). This SMO arrangement within the crystal suggests that 
SMOAC is monomeric, in agreement with size-exclusion chromato- 
graphy (SEC) coupled to multi-angle light-scattering analysis (MALS) 
(Extended Data Fig. 3f). 

SMO adopts an extended conformation in the structure. The 
extracellular CRD is perched on top of the linker domain, which forms 
a wedge between the TMD and CRD. At the apex of this wedge, the 
CRD contacts the TMD through the elongated TMD extracellular 
loop 3 (ECL3; Fig. la). The overall architecture is stabilized by nine 
disulfide bridges, four of which (numbered 2-5 in Fig. 1a) reveal the 
canonical disulfide pattern of the CRD fold”? and one that is specific 


1Division of Structural Biology, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK. 2Departments of Biochemistry and Medicine, Stanford University School 
of Medicine, Stanford, California, 94305, USA. ?Department of Biochemistry, University of Oxford, Oxford OX1 3QU, UK. “Diamond Light Source Ltd, Harwell Science & Innovation Campus, Didcot 
OX11 ODE, UK. Department of Developmental Biology, Washington University School of Medicine, St. Louis, Missouri, 63110, USA. 
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Figure 1 | Structure of human SMO. a, Two views of the overall structure 
showing the extracellular and transmembrane domains of human SMO 
in cartoon representation. Orange, CRD; pink, linker domain (LD); blue, 
TMD; red, inactivating point mutation Val329Phe; cyan, cholesterol; 
black, nine numbered disulfide bridges; yellow sticks, two N-linked 
glycans (NAG). A schematic of SMO is shown above (SP, signal peptide; 
BRIL, position of the BRIL fusion protein inserted between TMD helices 
5 and 6). b, The ‘connector’ region between the CRD and linker domain 
highlighted as sticks in atomic colouring, with the CRD shown as a 
solvent-accessible surface and the linker domain and part of the TMD 
ECL3 loop as cartoons. c, Interface between the CRD, linker domain and 
TMD shown in cartoon representation. Yellow sticks, ECL3-NAG; cyan 
sticks, cholesterol. 


for SMO" (marked as 1’ in Fig. 1a). This disulfide bridge positions the 
start of a ‘connector’ segment that links the CRD and linker domain 
(Fig. 1b). The connector is tucked along a hydrophobic groove that 
runs the length of the CRD and shields this groove with three inwardly 
turned hydrophobic residues (Val182, Ile185 and Phe187), multiple 
hydrophilic residues and an N-linked glycan facing the solvent (Fig. 1b 
and Extended Data Fig. 4a). The connector region, combined with 
mainly hydrophobic interactions between the CRD and both the linker 
domain and ECL3 (total buried surface area of 745 A’), orients the CRD 
in an upright conformation with its N terminus pointing away from the 
plasma membrane (Fig. 1a, b and Extended Data Fig. 4b). However, 
we did not observe major structural changes in the heptahelical TMD 
bundle when we compared it to previously solved structures of either 
antagonist- or agonist-bound complexes lacking the CRD!*"!”. The 
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only exception was a rearrangement of the linker domain, which in our 
structure is pushed down towards the TMD, perhaps by CRD binding 
(Extended Data Fig. 5). A structure of the activated state of the SMO 
TMD will probably require co-crystallization with its (still unknown) 
downstream effector or the use of an active-state stabilizing antibody. 


Cholesterol is a ligand for SMO 

Unexpectedly, we discovered a cholesterol molecule in our SMO 
structure (Fig. 1 and Extended Data Methods). Cholesterol occupies 
a central position, interacting with all three SMO domains (the CRD, 
linker domain and TMD; Figs 1c and 2a, b), and adopts an extended 
conformation with its tetracyclic sterol ring bound in a shallow groove 
in the CRD, a site previously shown!?-!? to bind 20(S)-OHC in SMO 
and the palmitoleyl group of Wnt ligands in Frizzled receptors”. The 
cholesterol iso-octy] tail, located at the interface between the CRD, 
linker domain and TMD ECL3, is buried in the SMO protein core. 
This arrangement positions cholesterol some 12 A away from the 
lipid bilayer of the plasma membrane (Fig. 2c) and indicates that a 
cholesterol molecule would have to completely desorb from the 
plasma membrane surface to access its binding site in SMO. Cellular 
cholesterol levels are permissive for Hh signalling”**°; however, this 
requirement is likely to be unrelated to the cholesterol binding site seen 
in our structure because SMO mutants lacking the CRD” or carrying 
mutations in the cholesterol-binding groove’ remain sensitive to 
cholesterol depletion (Supplementary Discussion). 

The sterol-binding site of SMO is predominantly lined with hydro- 
phobic residues from the CRD, which stabilize the flat a-face of 
cholesterol (Fig. 2a). Mutations in several of these residues (Leu108, 
Trp109, Prol64 and Phe166) have been noted to prevent SMO binding 
to 20(S)-OHC and to impair signalling driven by either 20(S)-OHC 
or SHH'*"”. The §-face is shielded by the side-chain of Arg161 and 
an N-linked glycan (Fig. 2b). The cholesterol 33-hydroxyl group is 


Figure 2 | The cholesterol binding site. a, b, Close-up of cholesterol with 
interacting residues as sticks. Initial 2F,-F. map at 1.0c before inclusion 
of cholesterol shown as chicken-wire. Colour-coding follows Fig. 1. 

Inset shows the potential hydrogen-bonding network coordinating the 
cholesterol 38-hydroxyl group. Interatomic distances (A) are shown in 
black. c, Solvent-accessible surface colour-coded by hydrophobicity (red, 
hydrophobic; white, hydrophilic). d, Sequence conservation (based on 

55 vertebrate SMO sequences) mapped onto SMOAC (black, conserved; 
white, not conserved). e, Superposition of human (orange, this study) 
and fly (purple, PDB 2MAH”°) SMO CRD structures. The fly CRD region 
occupying the cholesterol-binding site is highlighted by the dashed line. 
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incorporated in a hydrogen-bonding network with the side chains 
of Asp95, Trp109 and Tyr130; this optimally positions the «-face of 
the sterol ring system to make a stacking interaction with the indole 
ring of Trp109 (Fig. 2a, b). The Leu491, Ala492 and Ile496 residues 
of TMD ECL3 and the Val210 residue of the linker domain orient the 
iso-octy] tail of cholesterol in an elongated conformation. Residues 
lining the cholesterol-binding site are highly conserved in vertebrates 
but are less conserved in Drosophila (Fig. 2d and Extended Data Fig. 1). 
The superposition of our human CRD structure with a previously 
solved solution structure of the Drosophila CRD” (r.m.s.d. 1.71 A for 86 
equivalent Ca positions) revealed a major rearrangement of one edge 
of the cholesterol-binding site comprising fly SMO residues 183-190 
(Fig. 2e and Extended Data Fig. 1). This segment of fly SMO forms 
a short helix that protrudes into the CRD groove and consequently 
may preclude cholesterol binding. In fact, fly SMO does not bind 
oxysterols!°"1. 

Consistent with the structure, purified SMOAC bound to cholesterol 
in a ligand-affinity assay (Fig. 3a—c), analogous to the assay we 
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Figure 3 | The cholesterol-binding site regulates SMO signalling 
activity. a, Purified SMOAC captured on beads coupled to increasing 
concentrations of cholesterol in the presence or absence of free 20(S)- 
OHC. b, c, SMOAC captured on cholesterol beads in the presence of 
increasing concentrations of free 20(S)-OHC (b) or in the presence of 

a synthetic epimer 20(R)-OHC (c). d, Protein levels of SMO variants 
stably expressed in Smo~'~ mouse fibroblasts. WT, wild-type. The protein 
Suppressor of Fused (SUFU) served as a loading control. e, Binding of 
SMO variants to 20(S)-OHC-coupled beads. Asp99Ala and Tyr134Phe are 
predicted to disrupt hydrogen bonding with cholesterol (Fig. 2a). f, Levels 
of Glil mRNA (mean arbitrary units + s.d., n = 4) were used as a metric for 
Hh signalling activity in cell lines shown in d after stimulation with SHH, 
SAG or 20(S)-OHC-. Statistical significance based on one-way ANOVA is 
denoted for the difference in Glil mRNA levels between cells expressing 
wild-type SMO and cells expressing each mutant SMO protein. n.s., 
P>0.05; ****P < 0.0001. Each experiment was repeated 3 or more times. 


ARTICLE 


previously developed to measure the binding of SMO to 20(S)-OHC’. 
Beads covalently coupled to cholesterol captured purified SMOAC 
(Fig. 3a). Binding could be blocked by free 20(S)-OHC added as a 
competitor at concentrations that activate Hh signalling in cells” 
(Fig. 3b), confirming that cholesterol and 20(S)-OHC engaged the 
same binding groove on the CRD surface. In a stringent specificity 
control, 20(R)-OHC, an epimer with inverted stereochemistry at a 
single position that cannot bind to the CRD’ and cannot activate Hh 
signalling”, failed to block this interaction (Fig. 3c). 


Cholesterol promotes SMO signalling 

To assess the functional relevance of the cholesterol-binding site 
identified in our structure, we focused our mutagenesis efforts on 
the hydrogen-bonding network between the cholesterol 36-hydroxyl 
group and Asp95 and Tyr130 (Fig. 2a). The corresponding residues 
in mouse SMO, Asp99 and Tyr134, were mutated either individually 
or in combination to residues (Asp99Ala and Tyr134Phe) that lack 
hydrogen bond acceptor or donor groups. As cholesterol and oxysterols 
occupy the same groove in the CRD (Fig. 3b), they probably adopt 
a similar conformation and participate in at least some of the same 
interactions. Therefore, we used 20(S)-OHC binding and signalling 
to evaluate the effect of these mutations. First, we tested the ability of 
20(S)-OHC beads to capture SMO from detergent extracts (Fig. 3d, e). 
As previously demonstrated’, wild-type SMO can be captured on 
20(S)-OHC beads. However, SMO variants carrying the Asp99Ala 
and Tyr134Phe point mutations failed to bind to 20(S)-OHC beads, 
highlighting the importance of this hydrogen-bonding network for 
sterol binding (Fig. 3e). Next, we stably expressed untagged versions of 
these SMO mutants in Smo~/~ mouse fibroblasts (Fig. 3d) and assessed 
their abilities to restore signalling initiated by SHH, the TMD agonist 
SAG or the CRD agonist 20(S)-OHC (Fig. 3f). SMO-Asp99Ala and 
SMO-Tyr134Phe did not increase the basal activity of SMO in the 
absence of Hh agonists and did not significantly change the signalling 
response to SAG (Fig. 3f). This demonstrates that these mutants 
retained an intact TMD ligand-binding site and remained competent 
to transmit signals to cytoplasmic components. However, Asp99Ala 
and Tyr134Phe significantly impaired the ability of SMO to respond to 
both 20(S)-OHC and SHH (Fig. 3f). We conclude that these residues, 
and by implication the cholesterol seen in our structure, are important 
for signal-induced SMO activation. 


Inactive-state stabilization by the SMO CRD 

To assess the structural influence of the CRD in a membrane 
environment, we carried out molecular dynamics simulations of 
SMO in the presence and absence of cholesterol. Ten independent 
100-ns atomistic simulations were performed with SMO embedded 
in a phosphatidylcholine bilayer (Fig. 4a, b and Extended Data Fig. 6). 
These simulations revealed that the SMO CRD has substantial 
conformational flexibility when not bound to a ligand. In the presence 
of cholesterol, however, there was a pronounced decrease in this 
flexibility (Fig. 4a), consistent with the idea that cholesterol stabilizes 
the CRD structure. By contrast, cholesterol did not substantially 
change the conformational stability of the SMO TMD (Fig. 4b). The 
predominant cholesterol-induced stabilization was seen in the vicinity 
of the sterol-binding pocket, with some propagation to more distal 
CRD regions (Extended Data Fig. 6). Consistent with these simulations, 
the thermostability of purified SMOAC was reduced when cholesterol 
was depleted with methyl-3-cyclodextrin (Extended Data Fig. 6f, g and 
Supplementary Discussion). 

The molecular dynamics results prompted us to test the signalling 
consequences of destabilizing interactions within the extracellu- 
lar region of SMO. To probe the effect of interactions between the 
CRD and the linker domain, we generated two mutants (Pro120Ser 
and Ile160Asn/Glu162Thr) in the context of mouse SMO. These 
introduce bulky, N-linked glycosylation sites at positions 114 and 
156, respectively, of the human CRD that contact the linker domain 
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or antagonists (cyclopamine, SANT-1). Glil mRNA levels in untreated 
cells reflect constitutive activity of SMO. Numbers above bars indicate 
Glil fold-increase compared to untreated cells. Asterisks denote statistical 
significance based on one-way ANOVA for comparison with wild-type 
SMO. **P< 0.01, ****P < 0.0001. e, Solution-state SAXS profiles of 
untreated (grey) and 20(S)-OHC-treated (red) SMOAC. Lines are fits 
derived from the indirect Fourier transform of the shown data points. 
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(Fig. 4c). To disrupt the conformation of the linker domain itself, we 
mutated two cysteines (Cys197 and Cys217 in mouse SMO) that form a 
disulfide bond within the linker domain (numbered 6 in Figs 1a and 4c) 
to serines. We compared these mutants to SMO lacking the entire CRD 
(SMO-ACRD) (Fig. 4d and Extended Data Fig. 7a). Untagged SMO- 
ACRD stably expressed in Smo~'~ fibroblasts demonstrated a similar 
level of constitutive activity'’!* to that of wild-type SMO stimulated 
with saturating SHH (Fig. 4d and Extended Data Fig. 7b). 

The Pro120Ser, Ile160Asn/Glu162Thr, and Cys197Ser/Cys217Ser 
mutations mimicked the effect of complete CRD deletion—constitutive 
signalling activity was increased (Fig. 4d) despite the presence of high 
levels of PTCH1 (Extended Data Fig. 7a) and there was complete loss 
of responsiveness to 20(S)-yne (Fig. 4d). All three mutants remained 
sensitive to inhibition by SANT-1, which binds deep in the TMD and 
does not contact the linker domain'®. Cyclopamine and SAG, both of 
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which make contacts with the linker domain'®!”, could regulate SMO- 
Pro120Ser and SMO-Ile160Asn/Glu162Thr, but not SMO-Cys197Ser/ 
Cys217Ser. Thus, destabilization of either CRD-linker domain contacts 
or the linker domain itself increases the constitutive activity of SMO, 
implicating these regions in stabilizing an inactive state, analogous to 
the D(E)R*°Y motif in helix III of some Class A GPCRs”. 

To understand the basis for the loss of oxysterol responsiveness, 
we measured binding of each of the mutants to 20(S)-OHC beads 
(Extended Data Fig. 7c). Although the CRD mutations Pro120Ser and 
Tle160Asn/Glu162Thr impaired binding, the Cys197Ser/Cys217Ser 
mutation in the linker domain had no effect on binding to 20(S)-OHC. 
The observation that 20(S)-OHC cannot activate SMO-Cys197Ser/ 
Cys217Ser (Fig. 4d) even though it can bind normally (Extended Data 
Fig. 7c) suggests that the linker domain may transmit the conforma- 
tional changes that lead to SMO TMD activation in response to CRD 
ligands. To investigate conformational changes in SMO induced by 
CRD ligands in the solution state, we performed small angle X-ray 
scattering (SAXS) experiments on SMOAC in the absence of any 
exogenous ligands (apo-SMOAC) or SMOAC loaded with the agonist 
20(S)-OHC (Extended Data Fig. 8a). The initial SAXS curves (Fig. 4e) 
and further analyses””® (Fig. 4f and Extended Data Fig. 8b) show that 
20(S)-OHC binding induces a conformational change consistent with 
elongation and reduced globularity of SMOAC. As 20(S)-OHC and 
cholesterol bind to the same groove on the CRD (Fig. 3b), we conclude 
that replacement of cholesterol by 20(S)-OHC (a molecule that carries 
only a single additional hydroxyl) produces a conformational change 
that leads to SMO activation. 


The structure of SMO bound to vismodegib 

Our structural and functional studies highlighted the critical regulatory 
role played by CRD-linker domain-TMD contacts in controlling 
the conformation and activity of SMO. It is unclear whether these 
interactions are altered by TMD-targeted small molecules used to 
treat patients with Hh-driven cancers, because prior structures of SMO 
bound to small molecules have not included the CRD, precluding an 
assessment of how the extracellular and transmembrane domains of 
SMO communicate. We determined the crystal structure of SMO in 
complex with vismodegib, a potent TMD antagonist, to 3.3 A resolution 
(vismo-SMOAG; Fig. 5a and Extended Data Fig. 9a, b). Despite the 
fact that vismodegib is the most commonly used Hh pathway inhibitor 
in patients and resistance has already become a clinically relevant 
problem!*??°, there are no structures available for vismodegib in 
complex with SMO. 

Vismodegib is stabilized by a network of hydrogen bonds and 
hydrophobic interactions formed by the SMO TMD core (Fig. 5b). 
As expected, vismodegib occupies the TMD binding site previously 
identified in SMO complexes with other antagonists and agonists'°~'” 
(Extended Data Fig. 5). The vismodegib pyrimidine and chloro-benzyl 
rings are deeply buried, forming mainly hydrophobic interactions with 
the TMD core. The amide linker, which is stabilized by a potential 
hydrogen bond to the side chain of Asp384, occupies a central position 
within the seven transmembrane domain (7TM) bundle and connects 
to the chlorophenyl-methylsulfone moiety, which is oriented towards 
the extracellular domains and the entrance of the TMD pocket!®. This 
arrangement is stabilized by potential hydrogen bonds to the side 
chains of Gln477 and Arg400 and a hydrophobic stacking interaction 
of the vismodegib methylsulfone moiety and the aromatic ring of 
Phe484. Asp473, a residue that is mutated in vismodegib-resistant 
cancers'*?9°, stabilizes the potential hydrogen-bonding network 
around Arg400 (asterisk in Fig. 5b). The SMO-Asp473His mutation 
seen in drug-resistant cancers is ideally positioned to disrupt the 
vismodegib-binding site, in agreement with functional experiments 
showing that SMO-Asp473His cannot bind this drug’. Our structure 
also explains the vismodegib resistance mechanism for many other 
mutations in the TMD binding site observed in patients with advanced 
basal cell cancer??*° (Extended Data Fig. 9c-f). 
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Figure 5 | Structure of SMO in complex with the antagonist vismodegib. 
a, Overall structure showing full extracellular and transmembrane 
domains of human SMO (cartoon) in complex with vismodegib (green). 
Colour-coding follows Fig. 1. b, Close-up of the vismodegib-binding 

site. The asterisk denotes a residue mutated in vismodegib-resistant 
tumours. c-e, Comparison of apo-SMO (red) and vismo-SMO (blue). 

c, Superposition of the two SMO structures based on the TMD. Arrows 
indicate domain rotations. Cholesterol (red) and vismodegib (blue) shown 
as spheres. The dashed oval highlights the conformational change of the 
TM6 and ECL3. d, Close-up of the TMD-linker domain-CRD interface. 
Phe484 contacts the vismodegib methylsulfone group. e, Close-up of the 
sterol-binding site. Cholesterol, Arg161 and NAG shown as sticks. The 
partly disordered ECL3 loop of vismo-SMO is depicted as dotted line. 


The vismodegib-bound structure of SMO in its inactive state shows 
a marked conformational change in the extracellular domains and 
the ECL3 loop when compared to the apo-SMO structure (Fig. 5c). 
The upper part of transmembrane helix 6, which contacts the CRD 
and linker domain, is moved ~15° towards the linker domain and 
CRD, probably owing to an interaction between Phe484 and the 
methylsulfone moiety of vismodegib (Fig. 5b, d). This movement of 
helix 6 results in a rotational movement of the CRD (and to a lesser 
extent the linker domain), allowing ECL3 to intrude into the sterol- 
binding groove of the CRD. As a consequence, the side chain of CRD 
residue Arg161 forms a stacking interaction with Trp109 and occupies 
the space where cholesterol was located in the apo-SMO structure 
(Fig. 5e). This reorganization is predicted to occlude the cholesterol- 
binding site. Indeed, cholesterol was absent from the vismo-SMO 
structure (Fig. 5a), and the addition of vismodegib reduced the 
interaction of purified SMOAC with cholesterol beads (Extended Data 
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Fig. 9g, h). In summary, binding of vismodegib to the TMD site induces 
a conformational change that ultimately results in rearrangement of the 
sterol-binding site, providing a structural communication mechanism 
that explains the previously observed allosteric interaction’ between 
CRD and TMD ligands. More generally, this finding suggests that 
a similar conformational change, involving a shift of the ECL3 and 
a pivoting of the CRD on the extracellular end of the TMD bundle, 
may allow an extracellular signal to be transduced from the CRD to 
the TMD and ultimately across the membrane. From a therapeutic 
perspective, our results highlight an unanticipated role for the CRD and 
ECL3, including displacement of an extracellular cholesterol ligand, in 
stabilizing a drug-bound inactive state of SMO, providing a structural 
template for the development of the next generation of SMO inhibitors 
against Hh-driven cancers. 


Conclusion 

The structure of SMO provides insights into the mechanism by which 
a large extracellular region and two allosterically linked ligand-binding 
sites may regulate the activity of a GPCR. We propose that cholesterol 
functions as an endogenous SMO ligand that occupies the CRD groove 
and stabilizes a resting or apo conformation poised to respond to Hh 
signals. SMO signalling activity is compromised by mutations that 
prevent cholesterol binding or by antagonists such as vismodegib 
that act allosterically to occlude the cholesterol-binding site. SAXS 
data suggest that CRD ligands such as 20(S)-OHC, which displace 
cholesterol, produce an additional conformational change that leads 
to SMO activation. Identification of the mechanism by which PTCH1 
inhibits SMO will be necessary to understand how cholesterol-bound 
SMO is activated in response to endogenous Hh morphogens. We 
predict that SMO activation will involve alterations in the interactions 
between the CRD, ECL3, linker domain, and TMD that allow the TMD 
to adopt an active signaling state. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Reagents. NIH 3T3 and 293T cells were certified stocks obtained directly from 
ATCC. Smo~‘~ fibroblasts (which were used to express all the SMO mutants) 
have been described previously*! and were originally obtained from J. Chen and 
P. Beachy. The authenticity of Smo~‘~ cells was established by immunoblotting 
to ensure lack of endogenous SMO protein expression. Incoming cell lines were 
confirmed to be negative for mycoplasma contamination. SAG was obtained from 
Enzo Life Sciences; cyclopamine was obtained from Toronto Research Chemicals; 
SANT-1 was obtained from EMD Millipore; 20(S)-OHC was obtained from 
Steraloids. The synthesis of 20(S)-yne, 20(R)-OHC’ and 20(S)-amine-coupled 
beads for SMO binding assays has been described in detail previously”. Rabbit 
polyclonal antibodies against SMO, PTCH1 and SUFU have been described 
previously***?, and the mouse monoclonal antibody against GLI1 was obtained 
from Cell Signaling Technologies (L42B10). SHH-containing conditioned medium 
was made from 293T cells transfected with the N-terminal signalling domain of 
SHH and used at saturating concentrations (dilution of 1:4)”. 

Constructs. For large-scale expression and crystallization, a SMO construct 
(SMOAC) was designed by truncating the N and C termini of human SMO to 
leave the extracellular and transmembrane domains (UniProt Id. Q99835; residues 
32-555), and replacing intracellular loop 3 (Q99835; residues 429-440) with BRIL 
(UniProt Id. POABE7; residues 23-128). The synthetic gene encoding SMOAC was 
obtained from Geneart (Grand Island, NY) and cloned into the pHLSec vector** 
in frame to either a C-terminal Rho1D4 antibody epitope tag*>** or a mono Venus 
tag*”S followed by a Rhol1D4 antibody epitope tag. 

All mouse SMO mutants were made using Quikchange methods using a 
previously described construct*? encoding mouse SMO (pCS2+:YFP-mSmo), 
after removal of the N-terminal YFP tag by Xhol digestion. The mouse SMO- 
ACRD construct lacks residues 68-184. For stable-line construction, the mouse SMO 
coding sequence was transferred from pCS2+ to pMSCVpuro using Gibson cloning. 
Expression and purification of SMO. SMOAC was expressed by transient 
transfection in HEK-293S-GnTI~ (ATCC CRL-3022) cells in a typical batch 
volume of 9.6 |. Cells were grown in suspension at 37°C, 8.0% COs, 130 r.p.m. to 
densities of 2-3 x 10° cells mI” in protein expression medium (PEM, Invitrogen) 
supplemented with L-glutamine, non-essential amino-acids (NEAA, Gibco) and 
1% fetal calf serum (FCS, Sigma-Aldrich). Cells from 0.8 | cultures were collected 
by centrifugation (1,100 r.p.m., 7 min) and re-suspended in 120 ml Freestyle293 
medium (Invitrogen) containing 1.2 mg PEI Max (Polysciences), 0.4mg plasmid 
DNA and 5 mM valproic acid (Sigma-Aldrich), followed by a 3-6-h incubation 
at 160 r.p.m. Culture media were subsequently topped up to 0.8 | with PEM and 
returned to 130 r.p.m. 48-72 h after transfection, cell pellets were collected by 
centrifugation, snap-frozen in liquid N. and stored at -80°C, resulting in a total 
of ~150¢ of cell mass per 9.6 | suspension medium”. 

Frozen cell pellets were thawed, re-suspended in 10 mM HEPES pH 7.5, 
300 mM NaCl buffer supplemented with a 1:100 (v:v) dilution of mammalian 
protease inhibitor cocktail (P8340, Sigma-Aldrich) and solubilised with 1.3% (w/v) 
n-dodecyl-8-p-maltopyranoside (DDM, Anatrace) and 0.26% (w/v) cholesteryl 
hemisuccinate (CHS, Anatrace), then rotated for 1.5h at 4°C. Insoluble material 
was removed by centrifugation (10,000 r.p.m., 12 min, 4°C) and the supernatant 
incubated for 2h at 4°C with purified Rho-1D4 antibody (University of British 
Columbia) coupled to CNBr-activated sepharose beads (GE Healthcare). Protein- 
bound beads were washed extensively with 50 mM HEPES pH 7.5, 300 mM NaCl, 
10% (v/v) glycerol, 0.1% DDM, 0.02% CHS buffer and then with 50 mM HEPES 
pH7.5, 300mM NaCl, 10% glycerol, 0.05% DDM, 0.01% CHS buffer and eluted 
overnight in 50mM HEPES pH 7.5, 300 mM NaCl, 10% glycerol, 0.03% DDM, 
0.006% CHS, 500j1M TETSQVAPA peptide (Genscript). Eluate was concentrated 
to ~500,1l using a Vivaspin Turbo 4 PES 100kDa MWCO concentrator and loaded 
onto a Superose 6 10/300 column (GE Healthcare) equilibrated with 10 mM HEPES 
pH 7.5, 150mM NaCl, 10% glycerol, 0.03% DDM, 0.006% CHS. Peak fractions 
were pooled and concentrated to ~30 mg mI! using a Vivaspin 500 PES 100kDa 
MWCO concentrator. Samples were deglycosylated with Endoglycosidase F1 and 
incubated at room temperature for 1h. For the vismodegib complex, vismodegib 
(GDC-0449, Selleck Chem) dissolved at high concentration in DMSO was added 
to the protein sample to a final concentration of 10mM. 

For small-scale screening, cells in adherent format were transiently transfected 
using Lipofectamine2000 (Invitrogen) and the expressed protein, tagged with 
YFP?”38, was prepared in the same manner as above with quantities adjusted 
appropriately. For analysis, samples were loaded onto a Superose 6 3.2/300 
column (GE Healthcare) attached to a high-performance liquid chromatography 
system with automated micro-volume loader and in-line fluorescence detection 
(Shimadzu)*". 
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For thermostability experiments, SMOAC was expressed and purified as 

described above. After SEC purification, the pooled peak fractions were re-applied 
to purified Rho-1D4 antibody coupled to CNBr-activated sepharose beads. Equal 
amounts of beads were treated with different quantities of methyl-6-cyclodextrin 
(MBCD) and incubated with gentle rocking for 1h at 15°C before extensive 
washing with 300 mM NaCl, 50mM Hepes pH 7.5, 0.03% DDM and elution with 
300 mM NaCl, 50mM Hepes, pH 7.5, 0.03% DDM, 500}.M TETSQVAPA peptide. 
The thermostability of the different samples was assessed by heating aliquots of 
their eluates to the indicated temperatures for 1h before loading them onto a 
Superose 6 3.2/300 column on a Shimadzu system and following using absorbance 
at 280 nm. Samples were kept at the baseline temperature of 20°C when not heated. 
The construct used in this assay was not fluorescently tagged in order to avoid the 
potentially confounding effects of the fluorescent tag on overall stability. 
Stable cell lines. Stable cell lines expressing untagged SMO mutants were made by 
infecting Smo~'~ fibroblasts with a retrovirus carrying these constructs cloned into 
pMSCVpuro*. Retroviral supernatants were produced after transient transfection 
of Bosc23 helper cells with the pMSCV constructs**“?. Virus-containing media 
from these transfections was directly used to infect Smo~'~ fibroblasts, and stable 
integrants were selected with puromycin (2,1.g mI’). 

We had previously’” constructed stable lines using an identical strategy with 
SMO constructs carrying an N-terminal fluorescent protein (FP) tag; however, 
we found that epitope tagging of SMO with a fluorescent protein, or transient 
overexpression of tagged or untagged SMO, could impact the assessment of its 
signalling activity. For example, we previously measured a lower level of constitutive 
signalling activity for SMO-ACRD compared to the present study, probably due to 
the presence of an N-terminal YFP tag!°. Moreover, previous reports from three 
groups (including our own) reached somewhat divergent conclusions regarding the 
role of the CRD in basal and ligand-stimulated SMO activity’? ”. These differences 
may have been related to the use of different epitope tags and expression systems. 

Hence, all Hh signalling assays used in this study were performed with untagged 

SMO and SMO mutants stably expressed in Smo~'~ fibroblasts, with assessment 
of SMO protein levels in stable cell lines by immunoblotting. 
Hedgehog signalling assays. Stable cell lines expressing either wild-type SMO or 
SMO mutants were grown to confluence in Dulbecco's Modified Eagle’s Medium 
(DMEM) containing 10% Fetal Bovine Serum (FBS, Optima Grade, Atlanta 
Biologicals) and then exposed to DMEM containing 0.5% FBS for 24h to induce 
primary cilia assembly. These ciliated cells were then treated with saturating 
concentrations of various Hh agonists and antagonists in DMEM containing 0.5% 
FBS for 12h. 

For detection of proteins by immunoblotting, cells were washed in ice-cold 
PBS and lysed (30 min, 4°C) by agitation in modified RIPA buffer (50 mM Tris- 
HCl pH 8.0, 150mM sodium chloride, 2% NP-40, 0.25% deoxycholate, 0.1% 
sodium-dodecyl sulfate (SDS), 1mM dithiothreitol, 1 mM sodium fluoride and the 
SigmaFast Protease inhibitor cocktail). After clarification (20,000g, 30 min, 4°C), 
the protein concentration of each lysate was measured using the bicinchoninic acid 
assay (BCA, Pierce/Thermo Scientific). Lysate aliquots containing equal amounts 
of total protein were fractionated on SDS-PAGE gels (either a 8% tris-glycine gel 
or a 4-12% bis-tris gel), and transferred to a nitrocellulose membrane. Quantitative 
immunoblotting with the various antibodies was performed using the Li-Cor 
Odyssey infrared imaging system. In all immunoblots, vertical dashed black lines 
represent non-contiguous lanes from the same immunoblot juxtaposed for clarity. 
Each immunoblot was repeated 2-3 times for all experiments shown. 

Glil mRNA is a commonly used metric for Hh signalling activity, because 
Glil is a direct Hh target gene. Glil and Gapdh mRNA levels were measured 
by quantitative, reverse-transcription PCR (qRT-PCR) using the Power SYBR 
Green Cells-To-CT kit from Thermo Fisher Scientific and custom primers 
for Glil (forward primer: 5‘’-CCAAGCCAACTTTATGTCAGGG-3’ and 
reverse primer: 5‘-AGCCCGCTTCTTTGTTAATTTGA-3’) and Gapdh 
(forward primer: 5’-AGTGGCAAAGTGGAGATT-3’ and reverse primer: 
5/-GITGGAGTCATACTGGAACA-3’). Transcript levels relative to Gapdh were 
calculated using the AC; method and reported in arbitrary units. Each (RT-PCR 
experiment, which was repeated 2-4 times, included two biological replicates, each 
with two technical replicates. 

Statistical analysis of Glil mRNA levels across samples was performed using 
an ordinary one-way ANOVA test with a Holm-Sidak post-test to correct for 
multiple comparisons using the GraphPad Prism suite. Statistical significance in 
the figures is denoted as follows: NS, P > 0.05; *P <0.05, **P<0.01, ***P<0.001, 
DP < 0.0001. 

Ligand affinity chromatography. Ligand affinity chromatography to assess the 
interaction between SMO protein in detergent extracts and beads covalently 
coupled to 20(S)-amine has been described previously!’. Membranes from 
cells transiently or stably expressing constructs encoding mouse SMO variants 
were lysed in a DDM extraction buffer (50mM Tris pH 7.4, 150mM NaCl, 
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10% v/v glycerol, 0.5% w/v DDM and the SigmaFast EDTA-free protease inhibitor 
cocktail) for 2h at 4°C, followed by removal of insoluble material by ultracentrif- 
ugation (100,000g, 30 min). This DDM extract was incubated with 20(S)-OHC 
beads overnight at 4°C to allow binding to equilibrium. After extensive washing, 
proteins captured on the beads were eluted with reducing LDS sample buffer (Life 
Technologies) and 100 mM dithiothreitol. The presence of SMO in these eluates 
was determined by quantitative immunoblotting with an anti-SMO antibody 
and infrared imaging (Li-Cor Odyssey). Each experiment was repeated twice with 
similar results. 

Preparation of cholesterol-PEG3-sepharose using the azide-alkyne Huisgen 
cycloaddition reaction from the Click Chemistry toolbox. 100 1] packed 
PEG3-azide sepharose resin (22 |1mole per ml, Click Chemistry Tools) was 
washed 3 x 1 ml with 20% ethanol (v/v aq.) and re-suspended in 500 jl 20% 
ethanol (v/v). The bead slurry was supplemented with 1 mM CuSO,, 5mM Tris 
(3-hydroxypropyltriazolylmethyl)amine (THPTA), and 15 mM sodium ascorbate. 
The cycloaddition coupling reaction was initiated by adding 0.044 |1moles (high) 
of LKM-26, a previously described“ alkynyl cholesterol derivative synthesized 
in-house, to achieve an approximate ligand density of 1:50 (moles coupled azide 
functional groups: moles uncoupled azide functional groups). High, medium and 
low ligand densities (Fig. 2f) represent coupling ratios of 1:50, 1:200 and 1:1,000. 
The reaction was protected from light and allowed to proceed at room temperature 
for 20h with end-over-end rotation. The supernatant was removed from the 
resin, and the reaction quenched with 1mM EDTA in 20% ethanol (v/v aq.). The 
supernatant was extracted with diethyl ether and loaded on a normal-phase TLC 
plate next to an alkynyl cholesterol standard to assess the efficiency of coupling. 
The thin layer chromatography (TLC) plate was developed using 2% methanol in 
chloroform (v/v) and stained using 10% CuSO,/10% H3PO, followed by charring 
at 200°C for visualization. 

SMO pull-down assays using cholesterol-PEG3-sepharose. Binding reactions 
were carried out in buffer containing 20 mM HEPES pH 7.4, 150mM NaCl, and 
0.03% DDM in a final volume of 100,11. Each reaction contained 1 1g purified 
SMOAC, the same protein used for crystallization studies. Competitors were 
added to the binding reaction and incubated for 1h at room temperature before 
the addition of 2011 packed cholesterol-PEG3-sepharose. Reactions were 
incubated for 16h at 4°C for affinity capture. The binding reactions were washed 
3 x 1 ml with binding buffer and eluted using 2x Laemmli buffer for 30 min at 
room temperature. SMOAC levels in the input, flow-through and captured on the 
beads were determined by quantitative immunoblotting (Li-Cor Odyssey) using 
the 1D4 primary antibody (mouse 1:2,000). Each experiment was repeated twice 
with similar results. 

Crystallization and data collection. Protein samples were reconstituted into 
lipidic cubic phase (LCP) by mixing with molten lipid in a mechanical syringe 
mixer**. Molten lipid, consisting of 10% cholesterol (Sigma-Aldrich) and 90% 
9.9 monoacylglycerol (monoolein, Sigma-Aldrich), was mixed with detergent- 
solubilized protein (either apo or with 10 mM vismodegib in DMSO) at ~30mg ml! 
in a ratio of 60:40. A Gryphon robot (Art Robbins Instruments) was used to dis- 
pense 50 nl boluses of protein laden mesophase followed by 0.8 11 of precipitant 
solution onto each of 96 positions on a siliconized glass plate, which were then 
covered with a coverslip in a ‘sandwich-plate’ format. Crystals were grown at 
20°C and monitored by eye, using a microscope fitted with cross-polarizers, and 
subsequently imaged using a UV-imaging system (Rigaku Minstrel). apo-SMOAC 
crystallized in 0.1 M MES pH 6, 30% (v/v) PEG500 DME, 0.1 M sodium acetate, 
0.5mM zinc chloride, 0.1 M ammonium fluoride. vismo-SMOAC crystallized in 
0.09 M sodium acetate pH4, 0.09 M sodium malonate, 27% (v/v) PEG500 DME, 
0.1 M sodium acetate, 0.5 mM zinc chloride, 0.1 M ammonium fluoride. 

X-ray data collection was conducted at MX beamline 124 at the Diamond 
Light Source (Harwell, UK). Prior to data collection crystals were flash-frozen at 
105K. X-ray data were processed using Xia2 (refs 47, 48), scaled using XSCALE“ 
and merged using Aimless*®”°. The final data set used for structure solution and 
refinement was merged from data from nine crystals for apo-SMOAC and two 
crystals for vismo-SMOAC. Data collection statistics are shown in Extended Data 
Table 1. 

Structure determination, refinement and analysis. The apo-SMOAC structure 
was solved by molecular replacement in PHASER”! using the structure of human 
SMO TMD (PDB 4QIM"*), zebrafish SMO CRD (PDB 4C79"°) and BRIL (PDB 
4ETY°*) as search models. Extra electron density accounting for the region between 
CRD and LD, BRIL and TMD was immediately discernible after density modi- 
fication in PARROT™ (Extended Data Fig. 3e). We also observed extra density 
within the CRD ligand binding pocket (Fig. 2a) and assigned this to cholesterol 
bound in a stereo-specific orientation based on shape, coordination and refine- 
ment statistics (cholesterol addition improved the R-factors by over 1%), which 
is also in agreement with the markedly lower B-factor of the refined cholesterol 
compared to the protein backbone. Cholesterol may be derived from the cells (mM 


concentrations within the cell membrane) or from the LCP crystallization mix (that 
contained 10% (w/v) cholesterol). The apo-SMOAC polypeptide chain was traced 
using iterative rounds of BUCCANEER™, manual building in COOT® and refine- 
ment in autoBUSTER®™ and PHENIX™. This resulted in a well-defined model for 
the apo-SMOAC structure that included two molecules of SMO (residues 59-549) 
with a BRIL protein segment inserted between SMO TMD helices 5 and 6, two 
N-linked glycans and a cholesterol molecule. We observed a systematic disorder 
along the c axis (Extended Data Fig. 2a), resulting in alternating ordered and less 
ordered hydrophilic layers within the LCP-grown crystals. This was not caused by 
crystal non-isomorphy or pseudo-symmetry, because reducing the space group 
from C2 to P1 had no effect on the disorder. The visrmo-SMOAC structure was 
solved by molecular replacement using the apo-SMOAC structure. Extra electron 
density accounting for vismodegib and two well-ordered monoolein molecules 
was immediately apparent. The structure was refined using autoBUSTER™ 
and PHENIX” with non crystallographic and secondary structure restraints. 
Crystallographic and Ramachandran statistics are given in Extended Data Table 1. 
Stereochemical properties were assessed by MOLPROBITY™. Superpositions were 
calculated using the program COOT™, electrostatic potentials were generated 
using APBS*? and hydrophobicity was calculated according to the Eisenberg 
hydrophobicity scale®’, as implemented in PYMOL"'. Buried surface areas of 
protein-protein interfaces were calculated using the PISA webserver® with a 
probe radius of 1.4A. Sequence alignment was performed using MULTALIN® 
and formatted with ESPRIPT®. Program Caver was used with default settings to 
visualize the SMO TMD ligand binding pocket. 

Amphipol exchange and MALS. In order to avoid background light- 
scattering due to free detergent in solution, protein samples were exchanged 
into amphipol®(A8-35, Anatrace) at a mass ratio of 3:1 amphipol:protein and 
rotated at room temperature for 30 min. BioBeads (BIORAD), equilibrated in 
detergent-free buffer (10 mM HEPES pH 7.5, 300mM NaCl), were added to the 
protein-detergent-amphipol mixture at 10 mg per 1001] and incubated overnight 
at 4°C to remove all detergent molecules. For multi-angle light scattering (MALS) 
experiments, amphipol-solubilized protein at 1 mg ml"! was loaded onto a 
Superose 6 10/300 column (GE Healthcare), equilibrated in detergent-free buffer, 
on a Shimadzu system with inline MALS detector (Wyatt). Data were analysed 
using ASTRA6.1.2 software (Wyatt). For SMOAC, the values used for dn/dc and 
€ at 280 nm were 0.185 mlg™! and 1.541 ml (mg.cm)"], respectively. For protein 
conjugate analysis, the dn/dc used for amphipol A8-35 was 0.15 ml g"! (ref. 67). 
Molecular dynamics system setup. Simulations were performed using the 
GROMACS v4.6.3 simulation package®. Side chain ionization states were 
modelled using pdb2gmx (Histidine) and PropK, (all other residues)®””°. The 
N and C termini were treated with neutral charge. Intracellular loop 3 (occupied 
by the BRIL fusion in our crystal structure) was modelled using coordinates from 
the PDB entry 4N4W (ref. 16). The protein structure was then energy-minimized 
using the steepest descents algorithm implemented in GROMACS, before being 
converted to a coarse-grained representation using the MARTINI 2.2 force field”). 
The energy-minimized coarse-grained structure was centred in a simulation box 
with dimensions 100 x 100 x 180 A>. 270 1-palmitoyl-2-oleoyl-sn-glycero-3- 
phosphocholine (POPC) lipids were randomly placed around the protein and the 
system solvated and neutralised to a concentration of 0.15 M NaCl. An initial 1 1s 
of coarse-grained simulation was applied to permit the self-assembly” of a POPC 
lipid bilayer around the GPCR. During the coarse-grained simulation, the structure 
of the protein was maintained by an elastic network, allowing local conformational 
flexibility of the protein. Thus, the protein was able to adopt its optimal orientation 
within the lipid bilayer”*. The endpoint of the coarse-grained bilayer self-assembly 
simulation was converted back to atomic detail using a fragment-based protocol 
for the lipid conformations”, while retaining the original crystal structure of the 
protein, now located in its optimal orientation and position within the lipid bilayer. 
Equilibration of the atomic system was achieved through 1 ns of NPT simulation 
with the protein coordinates restrained, before the system was subjected to 100 ns 
of unrestrained atomistic molecular dynamics. Simulations were performed both 
in the presence and absence of the cholesterol ligand. Five repeat simulations were 
run for each case. 

Coarse-grained simulations. The standard MARTINI force field” and its 
extension to proteins’!”° was used to describe all system components. During 
the coarse-grained self-assembly simulation an ELNEDYN network” was 
applied to the protein using force constant of 500kJ mol"! nm” and a cutoff of 
1.5nm. Temperature was maintained at 310 K using a Berendsen thermostat’® 
with a coupling constant of 7;= 1 ps, and pressure was controlled at 1 bar using a 
Berendsen barostat”* with a coupling constant of 7, = 1 ps and a compressibility of 
5 x 10 ° bar’. Electrostatics and van der Waals interactions in the CG simulations 
were shifted between 0 and 1.2nm, and 0.9 and 1.2 nm, respectively, using the 
standard MARTINI protocol’®. An integration time step of 20 fs was applied. 
Covalent bonds were constrained to their equilibrium values using the LINCS 
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algorithm”. All simulations were run in the presence of standard MARTINI water 
particles”, and ions added to an approximate concentration of 0.15 M NaCl. 
Atomistic simulations. Atomistic simulations were run using the GROMOS53a6 
force field®°, and its extension to glycans*!. The system was solvated using the 
SPC water model, and ions added to yield an electrically neutral system with a 
NaCl concentration of approximately 0.15 M. Systems contained approximately 
140,000 atoms including 270 POPC molecules, ~41,000 water molecules, 149 
sodium ions, and 154 chloride ions. Periodic boundary conditions were applied, 
with a simulation time step of 2 fs. Temperature was maintained at 310 K using 
a V-rescale thermostat® with a coupling constant of 0.1 ps, while pressure was 
controlled at 1 bar through coupling to a Parrinello-Rahman barostat® with a 
coupling constant of 1 ps. Particle Mesh Ewald (PME)** was applied to model long- 
range electrostatics. The LINCS algorithm was used to constrain covalent bond 
lengths”. The g_dist, g rmsf and g_rms tools implemented in the GROMACS 
v4.6.3 software package®* were applied to analyse the simulations, with VMD*° 
and PyMOL" used for visualization. Ca r.m.s.d. calculations for the CRD were 
performed after a least-squares fit of the trajectory to Ca particles of the initial 
(crystal structure) CRD coordinates. Co r.m.s.d. calculations for the TMD were 
performed after a least squares fit to Ca particles of the initial (crystal structure) 
transmembrane helix coordinates, excluding the inter-helix loop regions of the 
TMD from this calculation. DSSP matrices were produced using the do_dssp tool 
implemented in GROMACS®. 
SAXS experiments. For size-exclusion chromatography-coupled small-angle 
X-ray scattering (SEC-SAXS), amphipol-exchanged untreated and 20(S)-OHC- 
treated (~5mM 20(S)-OHC) SMOAC were loaded onto a 4.8-ml KW-403 
column (Shodex), equilibrated in a no-detergent buffer, on an Agilent 1260 system 
(B21, Diamond Light Source). Approximately 45 11 of sample was injected at 9.6 
(20(S)-OHC-treated) or 13 (apo) mg ml"! using a flow rate of 160 jl per min. 
Chromatographic elution was directed into a specialized SAXS flow cell, with 
a 1.6mm path length, held at 20°C. SAXS measurements were made using a 
sample-to-detector distance of 4.09 m at a wavelength of 1 A. SAXS images 
(frames) were collected as a continuous set of 3 s exposures across the elution 
peak. The corresponding buffer background frames for producing the background- 
subtracted SAXS curve was collected at greater than 1.5 column volumes. 
Images were corrected for variations in beam current, normalized for exposure 
time and processed into 1D scattering curves using in-house beamline software 
(GDA). Buffer subtractions and all other subsequent analysis were performed 
with the program ScAtter (http://www.bioisis.net/scatter). Samples were checked 
for radiation damage by visual inspection of the Guinier region as a function of 
exposure time. Considerable differences between the SAXS curves (q > 0.05 A“!) 
of the treated and untreated samples imply major structural differences between the 
two states. Large differences between the two P(r) distributions imply a significant 
structural change in the 20(S)-OHC state. 


31. Varjosalo, M., Li, S. P. & Taipale, J. Divergence of hedgehog signal transduction 
mechanism between Drosophila and mammals. Dev. Cel/ 10, 177-186 
(2006). 

32. Rohatgi, R., Milenkovic, L. & Scott, M. P. Patched1 regulates hedgehog signaling 
at the primary cilium. Science 317, 372-376 (2007). 

33. Humke, E. W., Dorn, K. V., Milenkovic, L., Scott, M. P. & Rohatgi, R. The output 
of Hedgehog signaling is controlled by the dynamic association between 
Suppressor of Fused and the Gli proteins. Genes Dev. 24, 670-682 (2010). 
10.1101/gad.1902910 

34. Aricescu, A. R., Lu, W. & Jones, E. Y. A time- and cost-efficient system for 
high-level protein production in mammalian cells. Acta Crystallogr. D Biol. 
Crystallogr. 62, 1243-1250 (2006). 

35. Molday, R. S. & MacKenzie, D. Monoclonal antibodies to rhodopsin: 
characterization, cross-reactivity, and application as structural probes. 
Biochemistry 22, 653-660 (1983). 

36. Oprian, D. D., Molday, R. S., Kaufman, R. J. & Khorana, H. G. Expression of a 
synthetic bovine rhodopsin gene in monkey kidney cells. Proc. Nat! Acad. Sci. 
USA 84, 8874-8878 (1987). 

37. Zacharias, D.A., Violin, J. D., Newton, A. C. & Tsien, R. Y. Partitioning of 
lipid-modified monomeric GFPs into membrane microdomains of live cells. 
Science 296, 913-916 (2002). 

38. Nagai, T. et al. A variant of yellow fluorescent protein with fast and efficient 
maturation for cell-biological applications. Nat. Biotechnol. 20, 87-90 (2002). 

39. Rohatgi, R., Milenkovic, L. Corcoran, R. B. & Scott, M. P. Hedgehog signal 
transduction by Smoothened: pharmacologic evidence for a 2-step activation 
process. Proc. Nat! Acad. Sci. USA 106, 3196-3201 (2009). 

40. Miller, P. S. & Aricescu, A. R. Crystal structure of a human GABAy receptor. 
Nature 512, 270-275 (2014). 

41. Kawate, T. & Gouaux, E. Fluorescence-detection size-exclusion chromatography 
for precrystallization screening of integral membrane proteins. Structure 14, 
673-681 (2006). 

42. Bailey, E. C., Milenkovic, L., Scott, M. P, Collawn, J. F. & Johnson, R. L. Several 
PATCHED1 missense mutations display activity in patched1-deficient 
fibroblasts. J. Biol. Chem. 277, 33632-33640 (2002). 


ARTICLE 


. Pear, W. S., Nolan, G. P,, Scott, M. L. & Baltimore, D. Production of high-titer 


helper-free retroviruses by transient transfection. Proc. Nat! Acad. Sci. USA 90, 
8392-8396 (1993). 


. Ciepla, P. et al. New chemical probes targeting cholesterylation of Sonic 


Hedgehog in human cells and zebrafish. Chem. Sci. (Camb.) 5, 4249-4259 
(2014). 


. Chung, S.-K., Shim, J.-Y., Kang, M. G., Lee, K. W. & Kang, H. |. Studies of steroids 


as potential antifungal agent 2. Side chain modified cholesterols. Korean 
J. Med. Chem. 8, 14-17 (1998). 


. Caffrey, M. & Cherezov, V. Crystallizing membrane proteins using lipidic 


mesophases. Nat. Protocols 4, 706-731 (2009). 


. Winter, G. xia2: an expert system for macromolecular crystallography data 


reduction. J. Appl. Crystallogr. 43, 186-190 (2010). 


. Kabsch, W. Xds. Acta Crystallogr. D 66, 125-132 (2010). 
. Evans, P. R. & Murshudov, G. N. How good are my data and what is the 


resolution? Acta Crystallogr. D 69, 1204-1214 (2013). 


. Winn, M. D. et a/. Overview of the CCP4 suite and current developments. 


Acta Crystallogr. D 67, 235-242 (2011). 


. McCoy, A. J. et a/. Phaser crystallographic software. J. Appl. Crystallogr. 40, 


658-674 (2007). 


. Liu, W. et al. Structural basis for allosteric regulation of GPCRs by sodium ions. 


Science 337, 232-236 (2012). 


. Cowtan, K. Recent developments in classical density modification. Acta 


Crystallogr. D Biol. Crystallogr. 66, 470-478 (2010). 


. Cowtan, K. The Buccaneer software for automated model building. 1. Tracing 


protein chains. Acta Crystallogr. D 62, 1002-1011 (2006). 


. Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. 


Acta Crystallogr. D 60, 2126-2132 (2004). 


. BUSTER v. 2.10.2 (Global Phasing Ltd., Cambridge, United Kingdom, 


2011). 


. Adams, P. D. et al. PHENIX: a comprehensive Python-based system for 


macromolecular structure solution. Acta Crystallogr. D 66, 213-221 
(2010). 


. Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular 


crystallography. Acta Crystallogr. D 66, 12-21 (2010). 


. Baker, N.A., Sept, D., Joseph, S., Holst, M. J. & McCammon, J. A. Electrostatics 


of nanosystems: application to microtubules and the ribosome. Proc. Natl 
Acad. Sci, USA 98, 10037-10041 (2001). 


. Eisenberg, D., Schwarz, E., Komaromy, M. & Wall, R. Analysis of membrane 


and surface protein sequences with the hydrophobic moment plot. J. Mol. Biol. 
179, 125-142 (1984). 


. Schrodinger, L.L.C. The PYMOL Molecular Graphics System, Version 1.3r1 


(2010). 


. Krissinel, E. & Henrick, K. Inference of macromolecular assemblies from 


crystalline state. J. Mol. Biol. 372, 774-797 (2007). 


. Corpet, F. Multiple sequence alignment with hierarchical clustering. Nucleic 


Acids Res. 16, 10881-10890 (1988). 


. Robert, X. & Gouet, P. Deciphering key features in protein structures with the 


new ENDscript server. Nucleic Acids Res. 42, W320-W324 (2014). 


. Chovancova, E. et al. CAVER 3.0: a tool for the analysis of transport pathways in 


dynamic protein structures. PLOS Comput. Biol. 8, e1002708 (2012). 


. Tribet, C., Audebert, R. & Popot, J. L. Amphipols: polymers that keep 


membrane proteins soluble in aqueous solutions. Proc. Natl Acad. Sci. USA 93, 
15047-15050 (1996). 


. Gohon, Y. et al. Bacteriorhodopsin/amphipol complexes: structural and 


functional properties. Biophys. J. 94, 3523-3537 (2008). 


. Hess, B., Kutzner, C., van der Spoel, D. & Lindahl, E. GROMACS 4: Algorithms 


for highly efficient, load-balanced, and scalable molecular simulation. J. Chem. 
Theory Comput. 4, 435-447 (2008). 


. Olsson, M. H. M., Sandergaard, C. R., Rostkowski, M. & Jensen, J. H. PROPKA3: 


consistent treatment of internal and surface residues in empirical pKa 
predictions. J. Chem. Theory Comput. 7, 525-537 (2011). 


. Sendergaard, C. R., Olsson, M. H. M., Rostkowski, M. & Jensen, J. H. Improved 


treatment of ligands and coupling effects in empirical calculation and 
rationalization of pKa values. J. Chem. Theory Comput. 7, 2284-2295 
(2011). 


. de Jong, D. H. et al. Improved parameters for the martini coarse-grained 


protein force field. J. Chem. Theory Comput. 9, 687-697 (2013). 


. Scott, K. A. et al. Coarse-grained MD simulations of membrane protein-bilayer 


self-assembly. Structure 16, 621-630 (2008). 


. Stansfeld, P. J. et al. MemProtMD: automated insertion of membrane protein 


structures into explicit lipid membranes. Structure 23, 1350-1361 
(2015). 


. Stansfeld, P. J. & Sansom, M. S. Molecular simulation approaches to 


membrane proteins. Structure 19, 1562-1572 (2011). 


75. Marrink, S. J., Risselada, H. J., Yefimov, S., Tieleman, D. P. & de Vries, A. H. 


77. 


The MARTINI force field: coarse grained model for biomolecular simulations. 
J. Phys. Chem. B 111, 7812-7824 (2007). 


. Monticelli, L., Sorin, E. J., Tieleman, D. P,, Pande, V. S. & Colombo, G. Molecular 


simulation of multistate peptide dynamics: a comparison between 
microsecond timescale sampling and multiple shorter trajectories. J. Comput. 
Chem. 29, 1740-1752 (2008). 

Periole, X., Cavalli, M., Marrink, S. J. & Ceruso, M. A. Combining an elastic 
network with a coarse-grained molecular force field: structure, dynamics, 
and intermolecular recognition. J. Chem. Theory Comput. 5, 2531-2543 
(2009). 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


78. Berendsen, H. J. C., Postma, J. P. M., Vangunsteren, W. F., Dinola, A. & Haak, J. R. 
79. 


80. 


81. 


Molecular-dynamics with coupling to an external bath. J. Chem. Phys. 81, 
3684-3690 (1984). 

Hess, B., Bekker, H., Berendsen, H. J. C. & Fraaije, J. G. E. M. LINCS: A linear 
constraint solver for molecular simulations. J. Comput. Chem. 18, 1463-1472 
(1997). 

Oostenbrink, C., Villa, A., Mark, A. E. & van Gunsteren, W. F. A biomolecular 
force field based on the free enthalpy of hydration and solvation: the GROMOS 
force-field parameter sets 53A5 and 53A6. J. Comput. Chem. 25, 1656-1676 
(2004). 

Pol-Fachin, L., Verli, H. & Lins, R. D. Extension and validation of the GROMOS 
53A6(GLYC) parameter set for glycoproteins. J. Comput. Chem. 35, 2087-2095 
(2014). 


82. 
83. 


84. 
85. 
86. 
87. 


Bussi, G., Donadio, D. & Parrinello, M. Canonical sampling through velocity 
rescaling. J. Chem. Phys. 126, 014101 (2007). 

Parrinello, M. & Rahman, A. Polymorphic transitions in single-crystals — 
anew molecular-dynamics method. J. Appl. Phys. 52, 7182-7190 

(1981). 

Essmann, U. et al. A smooth particle mesh Ewald method. J. Chem. Phys. 103, 
8577-8593 (1995). 

Humphrey, W., Dalke, A. & Schulten, K. VMD: visual molecular dynamics. J. Mol. 
Graph. 14, 33-38 (1996). 

Karplus, P. A. & Diederichs, K. Linking crystallographic model and data quality. 
Science 336, 1030-1033 (2012). 

Zoonens, M. & Popot, J. L. Amphipols for each season. J. Membr. Biol. 247, 
759-796 (2014). 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Extended Data Figure 1 | Sequence alignment of SMO orthologues. 
Numbering corresponds to that of human SMO. Secondary structure 
assignments are displayed above the alignment and colour-coded as in 
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Fig. 1. Black arrows and numbers (£X.50) below alignment show class highlighted in purple. 
F Ballesteros-Weinstein nomenclature for GPCR helices!®. Residues 
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interacting with cholesterol are highlighted in red. Disulfide bridges are 
highlighted in yellow and numbered. N-linked glycosylation sites are 
depicted by a hexagon. The position of the Val329Phe point mutation is 
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Extended Data Figure 2 | Characterization of the SMO Val329Phe 
mutation. a, Superposition of SMOAC structure (blue) with the SMO- 
SANT-1 complex structure, which lacks the CRD (green, PDB 4N4W 
(ref. 16)), showing the TMD ligand-binding pocket as a yellow surface. 
Inset shows Val329, mutated to Phe in our structure. b, SEC analysis of 
fluorescently labelled SMOAC showing difference in expression levels of 
wild-type and Val329Phe variant (main protein peak ~20 min). c, 20(S)- 
OHC beads can bind both mouse wild-type SMO and Val333Phe (mouse 
Val333 corresponds to human Val329). Immunoblots, using an anti-SMO cells. f, Immunoblot shows SMO and GLI protein levels in these stable 


antibody directed against the ICD, were used to measure SMO captured on cell lines, with p38 as loading control. Each experiment was replicated >2 
20(S)-OHC beads. Adding 501M free 20(S)-OHC as a competitor reduced __ times with similar results. 


binding. d, Purified human SMOAC (the crystallization construct) 

binds to 20(S)-OHC beads. e, Smo~'~ mouse fibroblasts stably expressing 
SMO-WT or SMO-Val333Phe were exposed to SHH, SAG or 20(S)-OHC. 
Levels of endogenous Glil mRNA (mean arbitrary units + s.d.,n =4), 
measured by qRT-PCR, were used as a metric of Hh pathway activity 
because Gli] is a direct Hh target. Asterisks indicate statistical significance 
(#*** P < 0.0001) based on one-way ANOVA for the difference in Glil 
mRNA levels between identically treated SMO-WT and SMO-Val333Phe 
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Extended Data Figure 3 | Crystallization, structure solution and 
oligomeric state of SMOAC. a, SMOAC crystal packing. Asymmetric 
unit consists of two antiparallel SMOAC chains. Chain A coloured as in 
Fig. 1 with BRIL fusion in yellow; Chain B in grey. LCP crystal packing 
with alternating hydrophobic and hydrophilic layers perpendicular to the 
c axis. Molecules coloured as for Chain A. b, Pearson correlation 
coefficient (CC) analysis®® used to relate data quality with model quality. 
A CCwork and CCfree smaller than CC* indicates that the model does not 
account for all of the signal in the data (and is therefore not overfit). 

c-e, SigmaA-weighted 2F,—F, electron density maps of final refinement 
at 1.0o contour level. c, Val329Phe mutation. d, Extra density within TMD 
ligand-binding pocket (F,-F, maps shown at contour level of +30 (green) 
and -3a (red)) (This density could not be confidently assigned, probably 
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because of low occupancy within the crystal.). e, “Connector region 
linking the CRD and linker domain, with Asn188 and linked 

N-acetyl glycosamine moiety. f, SEC-MALS analysis of amphipol- 
solubilized SMOAC. Molar masses (Mw, black lines) and 280 nm 
absorption (grey line) plotted against elution time. Mw derived from 
protein-conjugate analysis indicated in parentheses. For clarity, graphs 
of My are shown only around main absorption peak. Theoretical Mw 
of SMOAC based on sequence is 71 kDa with the extra mass observed 
in MALS (~13 kDa) probably due to three N-linked glycosylation sites. 
Expected mass of protein-bound amphipol (A8-35) was previously 
determined to be 40-75 kDa (ref. 87), in agreement with our data. This 
analysis suggests that SMOAC is a monomer under our purification 
conditions. 
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Extended Data Figure 4 | Interfaces of the SMO CRD. a, b, Detailed 
interactions of the CRD with the connector region (a) and with the linker 
domain-TMD segment (b). The number of interactions are indicated in 
the top panel and coloured as indicated in the key box. For non-bonded 
contacts, the width of the striped line is proportional to the number of 
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atomic contacts. Residue colouring is according to amino acid: blue, 
positive (H,K,R); red, negative (D,E); green, neutral (S,T,N,Q); grey, 
aliphatic (A,V,L,I,M); purple, aromatic (E,Y,W); orange, proline (P) or 
glycine (G); yellow, cysteine (C). The figure is adopted from the PDBSUM 
server (http://www.ebi.ac.uk/pdbsum/). 
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Extended Data Figure 5 | Comparison of SMOAC with previously 0.483 A for 240 equivalent Ca positions), LY2940680 (pale green, PDB 
determined SMO TMD structures. Superposition of the SMOAC 4JKV, r.m.s.d. 0.493 A for 230 equivalent Ca positions), SAGI.5 (pale 
structure with SMO TMD structures. Structural alignment was performed yellow, PDB 4QIN, r.m.s.d. 0.623 A for 262 equivalent Ca positions). 
using the 7TM bundle as template (not including the linker domain or The box shows a close-up view of the linker domain region revealing 
TMD helix 8). SMOAC (red), SMO TMD complexed with cyclopamine a structural rearrangement in the SMOAC structure compared to 
(light orange, PDB 409R, r.m.s.d. 0.598 A for 243 equivalent Ca the previously determined SMO TMD structures lacking the native 
positions), antaXV (light blue, PDB 4QIM, r.m.s.d. 0.515 A for extracellular domain. 


233 equivalent Ca positions), SANT1 (pale cyan, PDB 4N4W, r.m.s.d. 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Cholesterol stabilizes SMO. a-e, MD 
simulations of SMO in a lipid bilayer. a, SMO embedded in a lipid bilayer 
with the CRD in orange, the seven-pass transmembrane region excluding 
intra- and extracellular loops (7TM) in blue and cholesterol in cyan. 

b-d, Relative r.m.s. fluctuations of the Ca atoms over the course 

of 5 x 100 ns of atomistic MD simulation in the presence and absence 

of cholesterol. The structures in b and c are shown as putty representations 
coloured from high conformational stability (that is, low r.m.s. 
fluctuations; blue/thin) to low stability (that is, high r.m.s. fluctuations; 


ARTICLE 


red/thick). e, Secondary structure DSSP matrices for each of the 
simulations. The asterisks in b, c and e all mark the helix spanning residues 
155-160, which is destabilized in the absence of bound cholesterol. 

f, g, Thermostability of purified SMOAC. See Supplementary Discussion 
for details. f, Compiled peak heights from thermostability SEC analysis of 
purified SMOAC after treatment with different MBCD concentrations. 

g, Example of raw SEC data used for the analysis in f. Samples were 
incubated at 35°C for 1h before loading onto the SEC column. 
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Extended Data Figure 7 | Effect of domain interface mutations on 
expression levels and 20(S)-OHC binding. a, Protein levels of SMO 
and also of PTCH1 and GLI (each of which is encoded by a direct Hh 
target gene) measured by immunoblot from Smo~'~ mouse fibroblasts 
stably expressing one of five SMO variants: wild-type SMO (WT); SMO 
lacking the entire CRD (ACRD); SMO with two mutations (Pro120Ser 
or Ile160Asn/Glu162Thr) that introduce glycosylation sites in the linker 
domain-CRD interface; and SMO lacking a conserved disulfide bond 
(Cys197Ser/Cys217Ser, marked 6 in Fig. 4c) in the linker domain. Elevated 
levels of GLI] and PTCHI reflect high constitutive signalling activity of 
each mutant. NS, a non-specific band detected by anti-PTCH1 antibody; 
SUFU, loading control. Different patterns seen in SMO panel are caused 


by different numbers of N-linked glycosylation sites. b, Glil mRNA levels 
(mean arbitrary units + s.d., 1 = 3) were used to assess Hh signalling 
activity in Smo~/~ cells stably expressing the indicated mouse SMO 
variants. One-way ANOVA was used to assess statistical significance 
(#*** P < 0.0001). D477R and M2 (Trp539Leu) are two previously 
described mutations in the TMD that increase constitutive signalling. 

c, Oxysterol-binding capacity of each SMO variant was determined (right 
blot) by its ability to bind to 20(S)-OHC beads in the absence or presence 
of 504M free 20(S)-OHC. Inputs for each binding reaction are shown 

on the left. Each experiment was repeated 2 or more times with similar 
results. 
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Extended Data Figure 8 | SAXS analysis of SMOAC. a, Overlay of size- Absorbance of the free amphipol is negligible and elutes ~5 min after the 
exclusion chromatograms monitored at 280 nm (A290) collected during amphipol-stabilized SMOAC samples. b, Dimensionless Kratky plot of 
SAXS measurements for apo-SMOAC (red), (+)20(S)-OHC SMOA apo- and (+)20(S)-OHC-loaded SMOAC SAXS data. Cross-hairs denote 
C (blue), amphipol (green) and BSA standard (black). Amphipol and the Guinier-Kratky point (V3, 1.1), the peak position for an ideal, globular 
BSA were injected at 10mg ml’. Inset shows curves normalized to peak particle. The slower decay of the transformed scattering intensities for 
height. BSA was used as a reference with a radius-of-hydration of 3.7 nm. (+)20(S)-OHC (blue) indicate a comparably less spherical particle. 
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Extended Data Figure 9 | Crystal structure of the SMOAC-vismodegib 
complex and structural analysis of mutations found in vismodegib- 
resistant cancers. a, Chemical structure of vismodegib. b, Close-up view 
of vismodegib-binding site. Colour-coding follows Fig. 5b. Composite 
omit map calculated with PHENIX at 1.00 shown as magenta chicken- 
wire. c, Mapping of residues that are mutated in vismodegib-resistant 
tumours (yellow highlights). Brackets indicate mutant residues. 

d-f, Close-up views of selected interactions. Native residues in blue and 
mutated residues in yellow. Arrows indicate position of potential clashes 
that could disrupt vismodegib binding. d, Gln477/Asp473 hot spot. The 
Gln477Glu mutation leads to a loss of the potential hydrogen bond of 
the glutamine sidechain to the chloride of the vismodegib chlorophenyl- 
methylsulfone moiety. The Asp473His mutation potentially destabilizes 
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the hydrogen-bonding network around Arg400 that coordinates the 
vismodegib chlorophenyl-methylsulfone moiety. e, The imidazole ring of 
His231 is within hydrogen-bonding distance of two carbonyl main-chain 
atoms of residues Ser385 and Val386 located on a loop coordinating the 
interaction of Asp384 with vismodegib’s amide linker. f, Trp281 forms 

a key hydrophobic interaction with the vismodegib pyrimidine ring that 
is deeply buried in the SMO helical bundle core. Mutation to cysteine 
would significantly destabilize this interaction while mutation of nearby 
Val321 to the bulkier methionine would probably result in a 
rearrangement of the Trp281 side chain. g, h, SMOAC captured on 
cholesterol beads in the presence of increasing concentrations of free 
vismodegib or 20(S)-OHC (h). Results from one of two independent 
pull-down experiments are shown. 
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Extended Data Table 1 | Crystallographic data collection and refinement statistics 


apo-SMOAC vismo-SMOAC 
Data collection 
Beamline DIAMOND-124 DIAMOND-I24 
Wavelength (A) 0.9686 0.9686 
No. crystals 9 2 
Resolution (A) 61.0-3.2 (3.41-3.20) 68.4-3.3 (3.39-3.30) 
Space group C2 P1 
Cell dimensions 
a, b, c (A) 122.9, 63.0, 208.6 52.8, 71.3, 107.0 
a,B, y (°) 90.0, 96.6, 90.0 91.5, 98.2, 105.9 
Unique reflections 25214 (3815) 21097 (2380) 
Completeness (%) 93.8 (78.8) 94.7 (85.8) 
Rmerge (%) 30.5 (>100) 15.2 (>100) 
Rpim (%) 13.8 (>100) 9.9 (>100) 
CCi2 0.980 (0.237) 0.953 (0.357) 
cc* 0.995 (0.580) 0.988 (0.460) 
CCwork 0.923 (0.354) 0.703 (0.340) 
CCtree 0.941 (0.333) 0.941 (0.315) 
Vol 4.3 (0.6) 3.7 (0.6) 
Multiplicity 5.8 (5.1) 1.8 (1.7) 
Refinement 
Resolution (A) 61.0-3.2 60.0-3.3 
No. reflections (test set) 25151 (1302) 20588 (1013) 
Rwork/Riree (%) 23.2/26.4 24.5/25.7 
No. atoms 
Protein 9207 8988 
Ligand/lon 58 119 
B factors (A?) 
Protein 137 124 
Ligand/lon 121 119 
r.m.s.d. bonds (A) 0.008 0.008 
r.m.s.d. angles (°) 0.90 0.87 
Ramachandran (%) 
Favoured 96.6 96.8 
Allowed 3.2 2.9 
Disallowed 0.2 0.3 


Numbers in parentheses refer to the highest resolution shell. Riree, R-factor against 5% of the data; r.m.s.d., root mean square deviation from ideal geometry; Rpim, precision indicating merging 
R-factor; CC1,/2, correlation coefficient between random half data sets; CC*, analytical estimate of CCtrue based on CC1/2; CCwork, standard correlation of the experimental intensities with the intensities 
calculated from the refined model; CCiree, cross-validated correlation of the experimental intensities with the intensities calculated from the refined model. Ramachandran statistics were calculated 
using MolProbity. 
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Suppression of star formation in dwarf galaxies by 
photoelectric grain heating feedback 


John C. Forbes!, Mark R. Krumholz!?, Nathan J. Goldbaum? & Avishai Dekel‘ 


Photoelectric heating—heating of dust grains by far-ultraviolet 
photons—has long been recognized as the primary source of 
heating for the neutral interstellar medium’. Simulations of spiral 
galaxies” have shown some indication that photoelectric heating 
could suppress star formation; however, simulations that include 
photoelectric heating have typically shown that it has little effect 
on the rate of star formation in either spiral galaxies** or dwarf 
galaxies°, which suggests that supernovae are responsible for setting 
the gas depletion time in galaxies®**. This result is in contrast with 
recent work® }3 indicating that a star formation law that depends 
on galaxy metallicity—as is expected with photoelectric heating, 
but not with supernovae—reproduces the present-day galaxy 
population better than does a metallicity-independent one. Here we 
report a series of simulations of dwarf galaxies, the class of galaxy 
in which the effects of both photoelectric heating and supernovae 
are expected to be strongest. We simultaneously include space- 
and time-dependent photoelectric heating in our simulations, and 
we resolve the energy-conserving phase of every supernova blast 
wave, which allows us to directly measure the relative importance 
of feedback by supernovae and photoelectric heating in suppressing 
star formation. We find that supernovae are unable to account 
for the observed" large gas depletion times in dwarf galaxies. 
Instead, photoelectric heating is the dominant means by which 
dwarf galaxies regulate their star formation rate at any given time, 
suppressing the rate by more than an order of magnitude relative 
to simulations with only supernovae. 

To investigate whether the depletion times in dwarf galaxies, 
which are longer than those for Milky-Way-like galaxies by more 
than an order of magnitude'*’®, are set by the momentum injection 
from supernovae or by photoelectric heating, we perform a series 
of high-resolution hydrodynamic simulations using the Enzo adap- 
tive mesh refinement code!®. We additionally include a prescription 
for supernova and pre-supernova stellar feedback, and a method 
for self-consistent, spatially dependent photoelectric heating (see 
Methods). We use two sets of initial conditions. Both correspond to 
isolated dwarf galaxies with an initially laminar gas disk with a gas 
surface density profile that falls off exponentially, a stationary hot halo, 
and collisionless particles representing stars and dark matter. The gal- 
axies have a dark matter halo mass of 10! solar masses (Mo), a stellar 
mass of 10’Mo and an observationally motivated’” cold gas mass of 
108Mo Galaxies in this mass range are comfortably above the limit at 
which star formation can be quenched by the cosmological ultravio- 
let background’, but small enough that the effects of heating by far- 
ultraviolet photons and supernova feedback’? are plausibly extreme. 
The initial conditions differ in the exponential scale-length chosen 
for the gas. One set uses 5 kpc, which is designed to mimic recently 
discovered, nearly starless galaxies”®, and which is towards the high 
end of the range observed for field dwarf galaxies'*; the other set uses 
1 kpc, which is towards the low end of the observed field dwarf range. 


For a galaxy with an H1 mass of 10°Mo, and assuming an exponential 
H1 profile, the observed relation”! between H1 mass and HI size sug- 
gests an H1 scale-length of about 1.9 kpc. 

To understand how supernovae and photoelectric heating each con- 
tribute to the evolution of these galaxies, we perform a straightforward 
numerical experiment. We run a fiducial simulation, starting from 
initial conditions with a 5-kpc gas scale-length, that includes super- 
novae and photoelectric heating, and simulations for which each of 
these effects is turned off in turn. We refer to these as the ‘SN + PE? 
‘PE only, ‘SN only’ and the ‘no feedback simulations. We also run the 
‘SN + PE; ‘PE only’ and ‘no feedback’ cases for initial conditions with 
a 1-kpc scale-length. The simulations with supernovae also include 
pre-supernova stellar feedback from winds and H 1 regions. For the 
5-kpc case, we re-run the simulations of the four feedback models at 
three maximum spatial resolutions: 10 pc, 5 pc and 2.5 pc. These reso- 
lutions are high enough, and the typical densities in which supernovae 
explode in these simulations are low enough, that our simulations do 
not suffer from the overcooling problem”’, whereby poorly resolved 
simulations overestimate the rate at which supernova-heated gas cools 
(see Extended Data Fig. 1). We focus first on the 10-pc-resolution sim- 
ulations with the 5-kpc initial conditions, because we ran these for the 
longest time. In Methods, we compare all our simulations to the higher 
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Figure 1 | The morphology of the gas. For each of the simulations, 

the surface density after 90 Myr of evolution, starting from initial 
conditions with a 5-kpc scale-length, is integrated between +200 pc in 
the y dimension (top panels) and in the z dimension (bottom panels). 
The morphology of the disk is essentially determined by the presence of 
supernovae, despite the fact that the ‘PE only’ and ‘PE+ SN’ runs have 
nearly identical star formation rates (Fig. 2). The star formation rate in 
the ‘SN only’ simulation is an order of magnitude higher than that in the 
‘PE+SN’ simulation, which is why the outflow and the disruption of the 
cold disk are more striking in the former. 
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Figure 2 | The star formation rates of the simulations. The long 
depletion times frequently observed in dwarfs (see Extended Data Fig. 2) 
are reproduced when photoelectric heating and supernova feedback are 
both included (‘PE + SN’ black lines). Turning off supernova feedback 

has almost no effect (compare the red and black lines), which implies that 
photoelectric heating alone is responsible for the long depletion times in 
the simulations. More compact galaxies (1-kpc initial scale-length; thin 
lines) have higher star formation rates and shorter depletion times than do 
less compact galaxies (5-kpc initial scale-length; thick lines), but, as in the 
extended case, photoelectric heating remains the dominant effect. 


resolution runs to evaluate the level of convergence of our results. 
We compare the 5-kpc and 1-kpc initial conditions below. 

We find that all of the simulations follow a similar initial transient 
behaviour. The gas disk cools from its centre outwards, causing the 
disk to collapse vertically. Stars form first in the centre, then further 
and further out. The central region of the galaxy after 90 Myr is shown 
(with the aid of the yt package”*) in Fig. 1, for each of the feedback 
models. In terms of large-scale morphology, the supernovae have the 
most noticeable effect, driving large outflows with mass loading fac- 
tors of the order of 10. Photoelectric heating slightly alters the global 
structure of the gas, but the two simulations without supernovae look 
quite similar. 

Figure 2 shows the star formation rates and depletion times as a 
function of time for each of the four feedback models. Each simulated 
galaxy experiences an initial transient period as the gas collapses from 
its initial state; however, the subsequent instantaneous star forma- 
tion rates and depletion times of the galaxies are markedly different 
depending on whether photoelectric heating was included or not. By 
contrast, two simulations that differ only in their inclusion of super- 
nova feedback result in similar depletion times. This immediately 
shows that photoelectric heating, not feedback from supernovae, 
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Figure 3 | The photoelectric heating rate. Every young star particle in the 
simulation produces far-ultraviolet radiation. The flux from all of these 
stars is summed in each cell (see Methods), yielding the volumetric heating 
rate ®/ny shown here. Given the low surface density of young stars, this 
distribution is highly inhomogeneous. The low metallicity and small size 
of the galaxy means that the mean free path of the far-ultraviolet photons 
is large, so the flux from each star and, hence, the heating rate falls off as 
r~? from each source (where r is the radial coordinate). 


is primarily responsible for the long depletion times observed in dwarf 
galaxies. Supernovae, or even a lack of photoelectric heating, can result 
in low star formation rates in the long run by rapidly ejecting gas or by 
locking gas into stellar remnants, respectively. This can be seen in the 
central kiloparsecs of the simulations without photoelectric heating: 
the star formation rate decreases in this region, but the depletion time 
is relatively unaffected. Only the simulations including photoelectric 
heating produce depletion times in reasonable agreement with the 
large values frequently observed in dwarf galaxies, as illustrated in 
Extended Data Fig. 2. 

We also check the dependence of our results on the gas scale-length. 
In Fig. 2, the thin lines show the results for our simulations with a 
1-kpc gas scale-length. We find that the more compact galaxies have 
higher star formation rates and shorter depletion times than do less 
compact galaxies, which explains the wide range of depletion times 
visible in the observational data. However, even for the 1-kpc simula- 
tions we find that the star formation rate is virtually unaffected when 
supernovae are disabled while keeping photoelectric heating active. By 
contrast, disabling photoelectric feedback causes the depletion time to 
decrease by an order of magnitude, to values that are inconsistent with 
the observed sample. This indicates that photoelectric heating, not 
supernovae, regulates star formation over a wide range of gas surface 
density in dwarf galaxies, and not just in the potentially extreme”, 
nearly starless dwarf galaxies. 

The means by which photoelectric heating suppresses star formation 
in our simulations is simple and intuitive. When a new star cluster is 
formed in the simulation, its most massive stars will emit photons 
with energies of 8-13.6 eV. These photons dominate the heating rate, 
owing to the grain photoelectric effect, because they have energies 
high enough to liberate electrons from dust grains, but low enough 
not to be absorbed by intervening neutral hydrogen (Fig. 3). The heat- 
ing rate in the vicinity of a newly formed star increases the equilib- 
rium temperature of the gas at fixed pressure or density. This in turn 
increases the Jeans mass of the gas and makes star formation more 
difficult (Fig. 4). 

We conclude that the physics responsible for setting the instantane- 
ous star formation law in dwarf galaxies—that is, the depletion time— 
is nearly independent of the physics determining the properties of 
the outflows. Ultimately young stars are responsible for photoelectric 
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heating and supernovae, but it is the moderate, local, instantaneous, 
volumetric heating of the former that controls the depletion time. 
Supernovae, despite their ability to eject mass from the galaxy at a 
rate larger than the star formation rate, are unable to shut down star 
formation locally and instantaneously. In the long run, even though 
dwarf galaxies have 10-100 Gyr depletion times, large mass loading 
factors mean that the timescale on which gas is lost from the inter- 
stellar medium is shorter—between 1 Gyr and 10 Gyr—implying that 
these galaxies may be in equilibrium between gas inflow and star for- 
mation plus outflows**-?’. Therefore, the star formation rate in these 
galaxies would ultimately be set by the value of the mass loading factor, 
whereas the mass of gas in the interstellar medium would be set by the 
depletion time. In turn, the mass loading factor is set by supernovae 
and the depletion time by photoelectric heating. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Our simulations follow the evolution of an isolated dwarf galaxy for approximately 
100 Myr using the adaptive mesh refinement code Enzo'®. We use a piecewise- 
parabolic mesh hydrodynamics solver, with an HLLC Riemann solver to follow the 
motion of the gas. The gravitational potential is computed on the same mesh used 
to solve the hydrodynamics. Dark matter and stars are included as collisionless 
particles acting independently of the hydrodynamics, except when new particles 
are formed or existing particles inject mass, metals and energy back into the gas 
according to our feedback model. 

We use the same sort of initialization as in previous work**, although with 
parameters appropriate for a dwarf galaxy. In particular, we create a set of stellar 
and dark matter particles with the MakeGalaxy code”. The density ofa stellar disk 
and bulge are pre-specified, and their velocities are set so that the galaxy begins in 
approximate Jeans equilibrium. The dark matter has an initial Hernquist profile. 
We use a halo mass of 10!°Mo, a concentration of 10, a spin parameter of 0.04, a 
stellar mass of 10’Mo anda stellar scale-length of 300 pc. 

We initialize the density of the gas disk as a function of cylindrical radius rand 
height z according to 


a (1) 


as 
pgltsZ) pue| Joo 

"4 
where / is a normalizing constant chosen to set the initial gas mass of the disk, 
ra is the scale-length and H is the scale-height. Equation (1) applies until the 
putative disk pressure paTa (where Ty is the temperature of the disk) falls below 
the halo pressure pT), at which point the density and temperature of each cell are 
set to those of the halo, p, and Th, respectively. Although, pq is spatially dependent, 
Pn, Ty and Ty, are all taken to be constant. In these simulations we set T;, = 10°K, 
Ty=1,000K, pp =2.34 x 10-3°gcm 3 and po = 1.41 x 10-4 gcm *. The scale- 
length and scale-height of the disk are r;=5 kpc for most simulations and 
H=30 pc, respectively. A number of simulations in the main text use ra= 1 kpc. 
These initial conditions are chosen to minimize the time the disk spends in its 
initial collapse phase and the influence of the galactic halo on the dynamics. In 
particular, at this temperature the halo does not monolithically cool onto the 
galaxy over the timescale on which the simulation is run. 

These initial conditions are evolved under the influence of hydrodynamics, 

gravity and cooling, with additional subgrid prescriptions for the creation of 
new star particles and feedback from young stars, specifically stellar winds, type 
II supernovae and a rough treatment of heating from photoionization. We also 
include a prescription for self-consistently calculating the volumetric heating rate 
from the grain photoelectric effect. 
Feedback. Our feedback prescription is designed to avoid the ad hoc modifications 
to the physics that are typically necessary to produce feedback strong enough to 
affect the properties of the galaxy. Our goal is to show that none of these modifi- 
cations is necessary to have effective feedback at sufficiently high resolution. The 
basic physical picture is that once the adiabatic radius of the supernovae can be 
resolved, they will expand in a somewhat realistic way, rather than immediately 
losing all their energy to the cell, as in the famous overcooling problem”. 

When each star particle is formed in the simulation with initial mass M,, we draw 
a number from a Poisson distribution to determine how many type II supernovae 
will be produced by the particle. The rate parameter of this Poisson distribution is 
taken to be \=1,104M,/ (10°Mo), where the prefactor is the number of supernovae 
produced over the lifetime of a 10°Mo stellar population formed in a burst of star 
formation according to Starburst99 (ref. 30) using a Chabrier initial mass function*”. 

For each supernova produced, we next draw a number from the delay-time 
distribution of type II supernovae, again determined from the output of 
Starburst99. This is essentially just the convolution of the lifetime of massive stars 
with the initial mass function. At the hydrodynamical time step during which the 
supernova explodes, we add 10°! erg to the internal energy of the cell in which 
the supernova resides. The mass of ejecta and its metallicity are also determined 
from the Starburst99 output, and are fit to piecewise-polynomial functions for use 
in Enzo. To ensure that the supernova goes off on the highest level of refinement 
in the simulation, we make the star particles ‘must refine’ if the particle has any 
supernovae remaining in its future. This means that the cells containing such 
particles are marked for refinement to the highest resolution level, guaranteeing 
that they, and at least the surrounding two cells in each direction, will be on the 
highest resolution level. 

In addition to the energy from the supernovae added to the cell at the appropri- 
ate time step, we also add energy to the cell before the supernova itself. From the 
delay time for each supernovae, we infer the mass of the star that will be exploding, 
from which we can estimate the total ionizing luminosity for the particle by adding 
up the contribution from every massive star that has yet to explode in that particle. 
Given an ambient density, that is, the density in the surrounding gas cell, we may 


then estimate the volume of gas that can be ionized by the calculated luminosity, 
as in the standard Strémgren analysis. If this volume is larger than the cell, and the 
cell has a temperature below 10* K, internal energy is added to the cell to set the 
temperature to 10*K. The cell is not modified by this photoionization prescription 
if its temperature is above 10‘ K. 

For cells with Strémgren volumes smaller than the cell volume, we compute 
the internal energy of the Stro6mgren sphere and compare it to the internal energy 
across the entire volume of the cell. If the whole cell has less internal energy than 
the Strémgren sphere alone, then we add internal energy to the cell to make the 
two equal; otherwise we do nothing. 

This treatment of photoionization feedback is extremely conservative. In the 
limit of small cell sizes or low densities, we may vastly underestimate the size of 
the Hr region, because only one cell will be at 104 K. In the limit of high density 
or large cells, the total energy being injected into the gas is also conservative, in 
that the energy of the cell is changed only if the entire internal energy of the cell is 
smaller than the internal energy associated with the Stromgren sphere on its own. 

In addition to photoionization, we include feedback from stellar winds. Once 
again we rely on Starburst99 to compute the wind luminosity, with several small 
modifications. The specific energy of the wind is taken from Starburst99 only when 
the particle will in the future experience a supernova from the death ofa star with a 
delay time less than 10 Myr. If no such supernovae will explode in the future, then 
the cell still loses mass to winds, but their specific energy is set to (10 kms" ')?. This 
bifurcation in wind temperatures takes place in the Universe as a result of a sharp 
change in opacity in the atmospheres of stars as a function of mass. 

One additional change we make to the specific energy of the wind occurs right 
at the beginning of the life of the star particle. During this time, the most massive 
stars in the population drive extremely hot winds, exceeding 10’ K. When the out- 
flowing winds dominate the material in a single cell, as sometimes happens in our 
simulations, this high temperature can substantially slow down the simulation and 
cause other numerical problems associated with a high density contrast. To ame- 
liorate this issue we cap the wind specific energy at 10°° K. We keep the total wind 
energy injected constant by slightly increasing the mass lost during this time. This 
changes the total mass returned by only a tiny amount because this phase is so short. 

Extended Data Fig. 1 shows that every supernova in the 10-pc-resolution sim- 
ulation with the 5-kpc initial scale-length explodes in a cell with a density below 
the critical density at which a supernova remnant would cool before expanding 
to be the size of a single cell in the simulation. We calculate this critical density by 
adopting the following value for the radius at which a supernova remnant exits the 
Sedov phase and enters the pressure-driven snowplough (PDS) phase™: 

2/7 
4 pc 
ny’ (Z/Zo)!!’ 

We take the energy of the supernova in units of 10°! erg to be Es; = 1, and set the 
mass fraction of elements heavier than helium to a value appropriate for the Sun, 
Z=Zg=0.02. In our initial conditions, the disk component has Z/Zg =0.1 and the 
halo has Z/Zg = 0.01, but for the purposes of calculating the critical density we use 
a higher value because supernovae produce enough metals that locally Z may be 
substantially higher than its initial value. By setting Rpps to the size of a single cell 
in the simulation, we can solve for the value of my (the number density of hydrogen 
atoms in units of cm~*) at which the PDS phase would be marginally resolved. 

By resolving this crucial piece of physics in these galaxies, we find that our 
results are relatively insensitive to the resolution at which we run the simulations. 
We compare the depletion time for each physical scenario, run at 10-pc, 5-pc 
and 2.5-pc resolution, in Extended Data Fig. 3, and we find that the results of the 
simulations tend to become independent of resolution after roughly 100 Myr of 
evolution. This is less clear for the runs that include supernovae, for which the 
2.5-pce simulations have not advanced as far as their lower-resolution counterparts; 
but even here there is reasonable agreement between the 10-pc and 5-pc runs. 
Photoelectric heating. Far-ultraviolet (FUV) photons from young stars liberate 
electrons from dust grains in the interstellar medium. This is the primary means 
by which the neutral atomic gas in the interstellar medium is heated in the Milky 
Way. To include it in our simulation, we assume the following proportionality: 


Reps = 


® x FeuvZny erg s-! cm-3 

The heating rate & from FUV photons is proportional to their flux Fryy, and the 
density of metals Zny. At low densities and high temperatures, there is an addi- 
tional dependence on the electron density and gas temperature, but these effects 
are negligible in the cold, dense gas where FUV heating is important for suppres- 
sion of star formation, so we omit that effect. We also do not include cosmic ray 
heating, because this is roughly an order of magnitude less important than FUV 
heating under the optically thin conditions that prevail in the low density, dust 
poor galaxies we are simulating. 
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We calculate the FUV flux in the simulation by taking the luminosity L for each 
star particle to be 


2 ‘ 
di j-oP th ify <3 


log oL(t7) = log, o[Mp/(10°Mo)] + s~* gti if3<t<10 (2) 
j=0G a 


0 otherwise 


where t; is the age of the star particle in units of 10’ yr, and M, is the mass of the 
star particle. L is in units of ergs~'. The coefficients of polynomials (p; and qj) are 
given in Extended Data Table 1. This expression is the result of integrating the 
output spectrum of a Starburst99 single-burst model for a cluster mass of 10°Mo 
(hence the pre-factor in equation (2)) over the range 8-13.6 eV at finely spaced time 
intervals out to 100 Myr. This function is somewhat sensitive to the initial mass 
function—recent indications of a bottom-light initial mass function*? in dwarfs 
would increase it by a factor of approximately two. 

The FUV flux is then simply Fryy =L(t)/(4nr°), where ris the three-dimensional 
distance from the centre of the gas cell in question to the star in question. This expres- 
sion for Fguy neglects any effects from self-shielding, which should be negligible in 
the galaxies we have simulated, given their low column densities and low metallicities. 
The total FUV flux at a given cell is the sum of this quantity over all stars in the sim- 
ulation. Ifa given star’s contribution to the FUV flux varies by less than 10% across a 
given grid (the computational element one step above cells in Enzo), then we approx- 
imate the contribution of that star as constant across the grid, to avoid doing the full 
order N by M computation (where N is the number of cells and M is the number of 
particles) in regions far from FUV-emitting particles. To compute &/ny, we scale 
®/ny, Feuy and Z to the known values of these quantities in the solar neighbourhood: 

© 9.519258 Hay 2 
ny s 0.0015859021 ergs"! cm? Zo 


The numerical constant in the denominator is the Habing™ estimate of the inten- 
sity of the interstellar radiation field in the solar neighbourhood, multiplied by c 
(the speed of light) to convert to a flux. The pre-factor of 8.5 x 10-8 is the photo- 
electric heating rate normalized to the Habing value for the solar neighbourhood 
from a radially dependent model of the interstellar medium of the Milky Way”. 
This quantity is computed for each cell in the simulation, and fed to the Grackle 
library, which computes the rate of change of the internal energy density!®*63”, An 
example of &/ny in the ‘PE only’ simulation is shown in Fig. 3. 

This method is a substantial improvement over many current implementations of 
the interstellar radiation field. Many simulations do not include this source of diffuse 
heating at all. Some include it as constant throughout the simulation volume**, some- 
times with a correction for self-shielding?, but without regard to the individual sources 
or time-dependence of the FUV photons. Other simulations explicitly approximate 
the radiative transfer of these photons**, while others go even further and explicitly 
model the dust particles via which this radiation interacts with the gas*°*“. Although 
the last two methods are better approximations to the interstellar radiation field than 
what we have implemented here, our simulations have a higher resolution. As a result, 
we resolve the Sedov—Taylor phase of the supernovae (Extended Data Fig. 1) and, 
hence, can definitively show whether supernova feedback or the interstellar radiation 
field is dominant in the regulation of star formation in dwarf galaxies. 

Star formation. The star formation prescription we use is similar to many com- 
monly used schemes. At each time step, the probability of forming a star in each 
cell is taken to be 


0 if excluded 


=). dt M. : 
p, min] 1, eg} —-——! | otherwise 
trp Mp 


where df is the simulation time step (at the refinement level in question), ef is the 
efficiency of star formation per free fall time, Mcen is the mass of gas in the com- 
putational cell, and the free fall time tg = ./3m/(32Gp). Cells are excluded if their 
mass is less than the Jeans mass in that cell or if the cell is not on the maximum 
refinement level. In other words, cells must have a density 


is yakpT 
NjGumy (Ax)? 

to form stars. Here y=5/3 is the ratio of specific heats, kg is the Boltzmann constant, 
jvis the mean molecular weight in units of the hydrogen mass my, Ax is the size of 
the cells on the maximum refinement level, Gis the gravitational constant and Nj =4 
is the number of cells by which we require that the Jeans length be resolved through- 
out the simulation to avoid artificial fragmentation“’. This criterion corresponds to 
a diagonal line in the density-temperature phase diagram above which stars are not 
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allowed to form (Extended Data Fig. 4). The value of p is such that the average star 
formation rate in the non-excluded cells follows a volumetric Schmidt Law: 
P= e¢rp/ te. This model has a few parameters that we must set, although we argue 
that we do not have a huge amount of freedom to change them. 

The efficiency per free fall time is constrained by observations to be within a 

factor of a few of 0.01 (refs 42, 43), so we simply adopt this central value. The mass 
of individual star particles M, should in principle be low enough to be irrelevant— 
this would have the advantage of sampling the star formation rate density very well 
and forming stars if and only if the cell were Jeans unstable. This introduces two 
numerical difficulties. The first is that the Jeans mass for the coldest gas in our 
simulations can be comparable to the mass of a single star. It would be problematic 
for our feedback recipe to form such low-mass stars, in that a 120Mo star or a 
supernova that ejects 10Mo of material could never exist within a 10Mg particle. 
Moreover, the number of star particles we would have to follow increases as M,, t, 
Each of these issues would be avoided by choosing a large particle mass. However, 
using a large mass increases the chance that the cell in which the star formed would 
have insufficient mass to supply all of the gas needed to form the particle. When 
this happens, to maintain global mass conservation, we gather the mass from neigh- 
bouring cells. Each cell supplying gas contributes the same fraction of its mass to 
the new particle. We have found that a particle mass of My = 50Mo constitutes a 
reasonable balance between keeping star formation local to one cell and keeping 
the supernova ejecta mass less than the mass of a single star particle. 
Comparison to observations. In Extended Data Fig. 2, we compare the star formation 
rate and depletion time of the simulations to those of dwarf galaxies from three different 
samples!4°4, We exclude four blue, compact dwarf galaxies from this sample, because 
these are known to be undergoing starbursts. We do not include estimates for the errors 
on each point. The statistical standard errors are of the order of 10%, but this underpre- 
dicts the true error, which is dominated by systematic uncertainty in distance (H1 mass 
and star formation rate) and the assumptions made in converting ultraviolet or Ha 
luminosities to star formation rates**. For the two smaller galaxies in ref. 20, no strict 
upper limits are given on the star formation rate; but, because they are not detected in 
the same Ha image in which a star formation rate of 4 x 10 *Mo yr_' was measured 
for the largest galaxy in the sample, we take this as a very conservative upper limit on 
the star formation rate for these two galaxies (S. Janowiecki, private communication). 
Data availability. The numerical experiments presented here were run with a fork 
of the Enzo code, available from https://bitbucket.org/jforbes/enzo-dev-jforbes, in 
particular, change set daed04d1e5e6. This altered version of Enzo also requires an 
altered version of the Grackle cooling library, available from https://bitbucket.org/ 
jforbes/grackle, in particular, change set 1243856. A subset of the raw data files is 
available from http://www.johncforbes.com/dwarfs.html. 
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Extended Data Figure 1 | Supernovae are well-resolved. The cumulative dashed line indicates the density at which a supernova remnant would 
distribution of the density of cells in which supernovae explode is shown radiate all of its energy before it expanded to the size of a single cell (10 pc) 
in red, and that in which stars form is shown in blue. The thin lines show in the simulation, assuming solar metallicity. Nearly every supernova in 
these distributions in different 10-Myr intervals, while the thick lines show __ the simulation occurs to the left of this line, indicating that the simulation 
the cumulative density function over the entire simulation. The vertical does not suffer from the overcooling problem. 
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Extended Data Figure 2 | Comparison to observations. a, b, Star 
formation properties (a, depletion time; b, star formation rate (SFR)) for 

a heterogeneous set'*?4 of galaxies are shown as a function of gas mass 
My. Upper and lower limits are indicated with arrows. Boxes representing 
the range of values covered by our simulations are over-plotted. The 
vertical range of the box is determined by the final snapshot for each 
simulation; the high (low) SFR extremum represents the 1-kpc (5-kpc) 
disk. No ‘SN only’ simulation was run for the 1-kpc case, so the blue box 
is not closed. Only simulations that include photoelectric heating agree 
with the depletion times observed for bulk of galaxies in the mass range 
we simulated. See Methods section ‘Comparison to observations’ for more 
details. 
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Extended Data Figure 3 | A resolution study. a-d, The depletion time depletion time when the measurement is carried out in a cylinder with 
of all 12 simulations with 5-kpc gas scale-length is plotted over time. The 1-kpc radius; the other lines use a 9-kpc radius. Regardless of the aperture, 
four feedback models are shown (a, ‘no feedback, grey; b, ‘PE only; red; the simulations quickly converge; differences between simulations 
c, ‘SN only; blue; d, ‘SN + PE} black), with the line style indicating the with factor-of-two differences in resolution are small compared to the 
result for different resolutions (see legend). The orange lines show the differences resulting from changing the physics. 
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Extended Data Table 1 | Parameters for a fit 


p, q 

0 415709926107 40.7875024388 

1 2.40501751872 -0.227682606645 
2 -9.19544984847 0.0078916423535 
3 10.5203892767 0 

4 -5.72637964222 0 

5 1.50479521662 0 

6 -0.153355377095 0 


Coefficients for the piecewise-polynomial fit to the FUV luminosity of a simple stellar population as a function of its age (Methods, equation (2)). 
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Solar-type dynamo behaviour in fully convective 


stars without a tachocline 


Nicholas J. Wright! & Jeremy J. Drake? 


In solar-type stars (with radiative cores and convective envelopes 
like our Sun), the magnetic field powers star spots, flares and other 
solar phenomena, as well as chromospheric and coronal emission 
at ultraviolet to X-ray wavelengths. The dynamo responsible for 
generating the field depends on the shearing of internal magnetic 
fields by differential rotation’. The shearing has long been thought 
to take place in a boundary layer known as the tachocline between 
the radiative core and the convective envelope’. Fully convective 
stars do not have a tachocline and their dynamo mechanism is 
expected to be very different‘, although its exact form and physical 
dependencies are not known. Here we report observations of four 
fully convective stars whose X-ray emission correlates with their 
rotation periods in the same way as in solar-type stars. As the 
X-ray activity-rotation relationship is a well-established proxy 
for the behaviour of the magnetic dynamo, these results imply 
that fully convective stars also operate a solar-type dynamo. The 
lack of a tachocline in fully convective stars therefore suggests 
that this is not a critical ingredient in the solar dynamo and 
supports models in which the dynamo originates throughout the 
convection zone. 

Stars across the Hertzsprung—Russell diagram are known to emit 
X-rays, with only a few exceptions. In main-sequence solar-type and 
low-mass stars, the X-rays arise from a magnetically confined plasma 
known asa corona that reaches temperatures of several million kelvin’. 
Coronal X-ray emission is ultimately powered by the dissipation of 
the magnetic fields generated by an interior magnetic dynamo. A 
close relation between the surface magnetic flux and X-ray radiance 
based on both solar and stellar observations that span several decades 
for both quantities® indicates that X-ray emission is a reliable proxy 
for magnetic activity. The dynamo is thought to be driven in part by 
differential rotation in the interior of the star’, which itself is generated 
by the action of the Coriolis force on the rotating convective envelope, 
but the detailed mechanism remains to be properly understood’. The 
relationship between stellar rotation and tracers of magnetic activity is 
therefore an important probe of the stellar dynamo. 

In solar-type stars, X-ray emission is observed to increase monoton- 
ically with increasing stellar rotational velocity for periods exceeding 
a few days*”. This relationship is often quantified in terms of the 
dependence of the ratio of stellar luminosity expended in X-rays to 
bolometric luminosity Lx/Lyo,, on the Rossby number, which is defined 
as Ro=P,,/T, the ratio of the stellar rotation period and the mass- 
dependent convective turnover time’. A recent study” using the largest 
available sample of 824 solar- and late-type stars fitted the rotation— 
activity relation as Lx/Lpo1= 5.3 x 10-°Ro 7”. 

This relationship has been observed in stars from late F-type through 
to early M-type, that is, those with radiative cores and convective 
envelopes. The interface layer between these two regions, named the 
tachocline, is believed to play an important role in the generation of 
the magnetic field*. Shear between the rigidly rotating core and the 
differential rotation of the convective envelope with latitude is thought 
to amplify and store the magnetic field’’, generating what is known as 


an a-( dynamo, named for the interplay between cyclonic eddies (the 
a effect) and the shearing of the field (the 2 effect). 

Others have argued that the latitudinal and radial gradients of the 
angular velocity in the convection zone may be sufficient for global 
dynamo action'”"3. In the Sun, the extremely strong levels of radial 
shear just beneath the surface are actually higher than in the tachocline, 
making it plausible that the solar dynamo is distributed across the 
convection zone rather than confined to the tachocline’”. 

Despite this hypothesis it has been widely accepted that magnetic 
structures in the convection zone would be disrupted by magnetic 
buoyancy or turbulent pumping, preventing large-scale magnetic 
fields from being established there!*. However, some recent three- 
dimensional magneto-hydrodynamic simulations without a tachocline 
have produced persistent magnetic wreaths in the convection zone’? 
and shown that it is possible to produce large-scale magnetic fields in 
stellar convection layers'®!”. 

For very fast rotators, the rotation-activity relationship has been 
found to break down, with X-ray luminosity reaching a saturation level 
of approximately Lx/Ly1 107°, independent of the spectral type’®. 
This saturation level is reached at a Rossby number of approximately 
0.13 + 0.02 (ref. 9; the error represents one standard deviation), 
corresponding to a rotation period that increases towards later spectral 
types, from 2 days for a star similar to the Sun to up to about 20 days for 
low-mass M dwarfs, and is also seen in both chromospheric emission 
and magnetic field measurements. It is unclear whether this is caused 
by a saturation of either the dynamo mechanism or the transport of 
the magnetic flux to the corona, a change in the type of dynamo at 
work within the star? or because coronal X-ray emission itself becomes 
insensitive to the strength of the magnetic field, the energy of which is 
then dissipated in other ways. 

Main-sequence stars later than spectral type M3-3.5 (M<0.4Mo, 
where M is the mass of the star and Mg is the mass of the Sun) are 
predicted to be fully convective and therefore do not possess a 
tachocline. If the tachocline is critical to the operation of a solar-type 
dynamo, fully convective stars should not be able to sustain such a 
dynamo. Instead, it is generally thought that they generate magnetic 
fields entirely by helical turbulence’. Nevertheless, observations 
indicate that stars throughout the M-type spectral range exhibit high 
magnetic field strengths’? and high fractional X-ray luminosities’. In 
fact, no discernible difference in magnetic activity properties has been 
identified on either side of the fully convective boundary. 

One problem with existing studies is that nearly all fully convective stars 
that have so far been studied have saturated levels of X-ray emission’. It is 
therefore unclear whether these stars all exhibit saturated levels of X-ray 
emission (possibly hinting at some facet of their dynamo mechanism), 
or whether slower rotators follow a more solar-like rotation—activity 
relationship. Studies of magnetic activity in slowly rotating fully convec- 
tive stars have so far been lacking, adding to the uncertainty about their 
dynamo mechanism. Despite this, fully convective stars are common 
and, given their spin-down times of a few billion years”, at least half of 
all such stars are expected to be slow rotators. 
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Figure 1 | Rotation-activity relationship diagram for partly and fully 
convective stars. Fractional X-ray luminosity, Lx/Lo, plotted against the 
Rossby number, Ro = P;o/T, for 824 partly (grey circles) and fully (dark 
red circles) convective stars from the most recent large compilation of 
stars with measured rotation periods and X-ray luminosities’. The best- 
fitting saturated (horizontal) and unsaturated (diagonal) rotation—activity 
relationships from that study are shown as black dashed lines. The four 
slowly rotating fully convective M dwarfs studied here are shown in light 
red (error bars indicate one standard deviation). The uncertainties for 

the other data points are not quantified, but will be comparable to the M 
dwarfs for the Rossby number and approximately twice as large for Lx/Lpol. 


Understanding the dynamo mechanism in these slowly rotating 
stars is important for various astrophysical problems, including the 
dynamo-driven angular momentum loss rate of low-mass stars, 
the particle and photon radiation environment of exoplanets and 
the notorious period gap in the cataclysmic variables. The lack of 
cataclysmic variables with periods in the 2-3h range is often attributed 
to a change in rotational spin-down for fully convective stars with 
diminished magnetic dynamos and stellar winds”!. Recent numerical 
simulations of stellar winds suggest that increasing the complexity 
of the surface magnetic morphology can also suppress angular 
momentum loss and spin-down without requiring any change in the 
total surface magnetic flux’. 

Four slowly rotating, fully convective M-type stars were observed 
by either NASA’s Chandra X-ray Observatory or the ROSAT satellite. 
These stars are of spectral type M4-5.5, well beyond the fully convective 
boundary (M3-3.5) and are therefore genuine fully convective stars. 
Figure 1 illustrates the traditional rotation—activity diagram showing 
the fractional X-ray luminosity as a function of the Rossby number for 
all of the stars from the most recent large-scale study of the rotation- 
activity relationship in solar-type and low-mass stars’. The positions of 
the four slowly rotating fully convective stars studied in this work are 
also shown, and are in excellent agreement with the rotation—activity 
relationship of partly convective stars. 

The observations presented here provide clear evidence for 
unsaturated X-ray emission in fully convective stars and quantification 
of their rotation-activity relationship. The results show that fully 
convective stars, at least when they have spun down sufficiently, 
operate a dynamo that exhibits a rotation-activity relationship that is 
indistinguishable from that of solar-type stars. As the dynamo action 
in fully convective stars is expected to be different from that in solar- 
type stars owing to their lack of a tachocline*, where the magnetic field 
is thought to be amplified by radial shear, this is a surprising finding. 

The most direct conclusion from these observations is that both 
partly and fully convective stars operate very similar rotation- 
dependent dynamos in which the tachocline is not a vital ingredient 
and differential rotation combined with the action of the Coriolis force 
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is sufficient’. This implies that current models for the solar dynamo that 
rely on the tachocline layer to amplify the magnetic field are incorrect, 
and lends weight to recent three-dimensional magneto-hydrodynamic 
simulations without a tachocline that produce large-scale magnetic 
fields entirely within the convective layers”*”*. Recent studies with 
mean-field dynamo models have also suggested that differential 
rotation in the convection zone may play a greater role in the generation 
of the magnetic field than does the tachocline”>”®, 

Another alternative possibility is that fully convective stars are able 
to generate a purely turbulent dynamo that exhibits a rotation—activity 
relationship that is similar to that in partly convective stars, but by 
a different mechanism that does not rely on a shear layer. Existing 
dynamo simulations for fully convective stars succeed in generating 
magnetic fields, but are unable to predict their behaviour as a function 
of the rotation rate!’. However, it seems unlikely that both partly and 
fully convective stars would have the same rotation-activity relationship 
(requiring both their dynamo efficiency and rotational dependence to 
behave in the same way) without their dynamo mechanisms sharing 
a major feature. 

A third possibility is that convection in the cores of fully convective 
stars could be magnetically suppressed”, leading to the existence of a 
solar-like tachocline, although some studies suggest that convection 
would not be completely halted, only made less efficient”®, Furthermore, 
the field strengths that are necessary for such a transition are 10-108 G 
(refs 28, 29), orders of magnitude larger than the fields thought to exist 
in the solar interior and at levels that simulations suggest are impossible 
to maintain*®. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

New observations. Two targets were chosen from the MEarth Transit survey” 
of fully convective stars: G184-31 and GJ 3253, of spectral types M4.5°! and M5* 
with rotation periods of 83.8 days (ref. 33) and 78.8 days (ref. 20), respectively. 

The targets were observed with the Advanced CCD Imaging Spectrometer 
(ACIS)*# on the Chandra X-ray Observatory*® using the ACIS-S (spectroscopic) 
CCD array. Observations were performed in ‘very faint mode and were placed on 
the back-illuminated S3 (the 3rd CCD chip in the ACIS-S spectroscopic array) chip 
(owing to its higher sensitivity to soft X-rays. The observations were performed on 
6 December 2012 and 25 September 2013 for G184-31 and GJ 3253 respectively, 
with exposure times of 8.0ks and 21.0ks. 

Observations were processed using the CIAO 4.5 software tools*® and the 
CALDB 4.5.8 calibration files following standard procedures. The two sources were 
clearly identified and detected with significances of 5.00 and 10.20 at their 
expected positions. Point-source extraction was performed using CIAO 4.5. The 
total number of net counts was measured to be 35*7 and 1241}? for G184-31 and 
GJ 3253, respectively. Light curves were constructed for both sources to search for 
high levels of variability that might inflate the quiescent flux level measured, but 
no significant variability was detected. 

Thermal plasma X-ray spectral models were fitted to the extracted spectra using 
XSPEC” version 12.6.0 and compared to APEC* single-temperature optically thin 
model spectra of an absorbed thermal plasma in collisional ionization equilibrium, 
allowing the plasma temperature (kgT, where kg is Boltzmann’s constant and T is 
temperature) and the hydrogen column density (Ny) to vary freely. A grid of initial 
thermal plasma temperatures covering the range kgT’=0.1-3.0 keV was used to 
prevent fitting to local minima. The model with the lowest Cash statistic’? was 
selected as the best fit for each source (the Cash statistic is an application of the 
likelihood ratio test that is suitable for low-signal data). The best-fitting thermal 
plasma temperatures were found to be kg T= 0.78 + 0.13 keV and 0.30 + 0.05 keV 
for G184-31 and GJ 3253, respectively, consistent with the values found for other 
M-type dwarf stars. The fitted hydrogen column density is consistent with no 
absorption, as expected for the proximity of these sources. 

Absorption-corrected broadband (0.5-8.0 keV) fluxes of Fx = (2.08 + 0.38) x 
10° erg/s/cm? and (4.35 + 0.41) x 10°“ erg/s/cm? were calculated from the 
model fits for G184-31 and GJ 3253 respectively. Combined with their known 
parallax distances*!” the fluxes were used to calculate X-ray luminosities in the 
ROSAT band (0.1-2.4 keV), for consistency and ease of comparison with previous 
studies’. Fractional X-ray luminosities were then calculated from the observed 
J-band magnitudes and the appropriate bolometric corrections”®. 

Literature data. We searched the literature for fully convective stars (spectral 
type M4 or later) with existing measured rotation periods and X-ray luminosities, 
excluding any with short rotation periods (P,ot < 20 days), which would place the 
object in the saturated regime of Fig. 1. Two stars were found that met our criteria: 
GJ 699 (Barnard’s star, M4V; ref. 41) and GJ 551 (Proxima Centauri, M5.5V; ref. 42) 
with rotation periods*’ and X-ray luminosity values“! existing in the literature. 
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Rossby numbers and fractional X-ray luminosities for these stars were calculated 
as for the two newly observed stars. 

We also uncovered a number of stars that are close to the convective boundary 
in the literature**“° (spectral types M3 or M3.5). The spectral types and colours? 
of these stars imply that they are partly convective and so they are shown in 
Fig. 1 as grey dots. 

Sample size. No statistical methods were used to predetermine sample size. 
Code availability. The CIAO code used to reduce the Chandra X-ray Observatory 
data are available at http://cxc.cfa.harvard.edu/ciao and the associated calibration 
database can be found at http://cxc.cfa.harvard.edu/caldb. The XSPEC code used to 
perform X-ray spectral fitting is available at https://heasarc.gsfc.nasa.gov/xanadu/ 
xspec. 
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Combinatorial design of textured mechanical 


metamaterials 


Corentin Coulais!’, Eial Teomy*, Koen de Reus!, Yair Shokef* & Martin van Hecke!? 


The structural complexity of metamaterials is limitless, but, in 
practice, most designs comprise periodic architectures that lead to 
materials with spatially homogeneous features’"'!. More advanced 
applications in soft robotics, prosthetics and wearable technology 
involve spatially textured mechanical functionality, which requires 
aperiodic architectures. However, a naive implementation of such 
structural complexity invariably leads to geometrical frustration 
(whereby local constraints cannot be satisfied everywhere), which 
prevents coherent operation and impedes functionality. Here we 
introduce a combinatorial strategy for the design of aperiodic, 
yet frustration-free, mechanical metamaterials that exhibit 
spatially textured functionalities. We implement this strategy 
using cubic building blocks—voxels—that deform anisotropically, 
a local stacking rule that allows cooperative shape changes by 
guaranteeing that deformed building blocks fit together as in a 
three-dimensional jigsaw puzzle, and three-dimensional printing. 
These aperiodic metamaterials exhibit long-range holographic 
order, whereby the two-dimensional pixelated surface texture 
dictates the three-dimensional interior voxel arrangement. They 
also act as programmable shape-shifters, morphing into spatially 
complex, but predictable and designable, shapes when uniaxially 
compressed. Finally, their mechanical response to compression by a 
textured surface reveals their ability to perform sensing and pattern 
analysis. Combinatorial design thus opens up a new avenue towards 
mechanical metamaterials with unusual order and machine-like 
functionalities. 

The architecture of a material is crucial for its properties and func- 
tionality. This connection between form and function is leveraged 
by mechanical metamaterials!-!°, whose patterned microstructures 
are designed to obtain unusual behaviours such as negative response 
parameters”, multistability>*"! or programmability®!”. For ordinary 
materials, aperiodic architectures and structural complexity are asso- 
ciated with geometric frustration!®!”, which prevents a coherent and 
predictable response. Frustration hinders functionality, and metama- 
terial designs have therefore focused on periodic structures. However, 
the high level of control provided by 3D printing provokes the question 
of whether one can design and create structurally complex, yet frustra- 
tion-free, metamaterials!*. 

We investigate this question using a combinatorial design strategy. 
We assemble 1-cm? flexible, cubic building blocks, or voxels, into a 
cubic lattice that then forms a metamaterial (Fig. 1a). These building 
blocks are anisotropic and have one soft mode of deformation aligned 
with their internal axis of orientation, resulting in elongated or flattened 
shapes that we refer to as ‘bricks’ with positive or negative polariza- 
tion (Fig. 1b). Generally, mechanical metamaterials with randomly 
orientated building blocks are frustrated, because it is impossible for 
all blocks to cooperatively deform according to their soft mode: the 
bricks do not fit. We call voxelated!? metamaterials that allow soft 
deformations, or, equivalently, in which all bricks fit, ‘compatible. A 
trivial example of a compatible configuration is a periodic stacking 


of alternately polarized, parallelly orientated bricks. Hence, a peri- 
odic metamaterial consisting of parallel blocks is expected to exhibit a 
collective and harmonious deformation mode. We realized such a 
metamaterial by a combination of 3D printing and moulding (see 
Methods). Uniaxial compression indeed triggers a collective pattern 
change’ in three dimensions and produces the expected staggered 
configuration in which each brick is adjacent to six bricks of opposite 
polarization (Fig. 1c; Supplementary Video 1). 

We now consider the way in which to design aperiodic, yet frustration- 
free, mechanical metamaterials. Crucially, the internal structure of our 
blocks is anisotropic and each block can be oriented independently to 
allow structurally complex architectures. We think of these blocks as 
voxels, represent their orientation at each lattice point with a colour 
(Fig. 1d) and explore the discrete design space of such voxelated meta- 
materials. Compatibility requires two conditions to be met. First, pairs 


Frustration free 


Figure 1 | Voxelated mechanical metamaterials. a, A flexible anisotropic 
building block in its undeformed state. b, Corresponding flattened (—) 
and elongated (+) deformed bricks. c, A 5 x 5 x 5 metacube consisting 
of parallel blocks (left) shows a collective deformation under uniaxial 
compression (right). d, Bricks (top) and their schematic representation 
(bottom), with the colour indicating orientation, and black dents and 
white protrusions representing deformations. e, Two examples where 
appropriately polarized, adjacent bricks fit together. f, Periodic (left), 
complex (middle) and frustrated (right) 2 x 2 x 2 stackings—for the 
frustrated stacking, no consistent brick configurations exist (grey/ 
dashed). Schematic symbols are separated from one another for ease of 
visualization. 
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Figure 2 | Combinatorial design. a, Mapping of bricks to internal spins 
and then to surface spins (from left to right). b, Set of five pairs of motifs 
compatible with a given x texture, and examples of periodic (i, ii), quasi- 
periodic (iii; beginning of the Fibonacci sequence) and aperiodic (iv) motif 
stackings. c, Exact number of compatible L x L x L spin configurations 2 
(black circles) and lower and upper bounds (blue shaded region); 


of neighbouring bricks should exhibit closely matching shapes along 
their common face. Our building blocks are precisely designed such 
that, given the polarization of one brick, the polarization of an adjacent 
brick can be adapted so that the pair have a tight fit, irrespective of 
their mutual orientation. Hence, we need to track only the outward or 
inward deformations of the surfaces of the building blocks (Fig. le). 
The second compatibility condition concerns the combinatorics 
of the voxel arrangement: all bricks should fit, such that protrusions 
and depressions of all neighbouring bricks are matched. In general, 
the first condition can be met by clever building-block design, whereas 
the second condition leads to a complex combinatorial 3D tiling 
problem. 

As we will show, although the compatibility condition is violated in 
most random configurations, which are consequently frustrated, our 
specific building blocks allow for many complex configurations where 
all protrusions and depressions match. These non-parallel, structurally 
complex, yet compatible, architectures combine the rich spatial texture 
of aperiodic materials with the predictability of ordered materials, and 
form the blueprint for aperiodic, frustration-free mechanical meta- 
materials (Fig. 1f). 

The design of complex architectures is simplified by mapping brick 
configurations to spin configurations that satisfy a so-called ‘ice rule?” 
and, as such, is reminiscent of tiling* and constraint-satisfaction?>”° 
problems. We identify each brick with a vertex, connected to neigh- 
bouring vertices by bonds that represent the common face between 
bricks (Fig. 2a). Dents and bumps map to inward or outward spins o;, 
Oy, 0, and by placing a single spin per bond, the first compatibility con- 
dition is trivially satisfied. The second condition maps to the ice rule, 
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see Methods and Supplementary Information. d, A 10 x 10 x 10 metacube 
reveals its precisely designed surface texture under uniaxial compression. 
Square surface pedestals added for visualization. Inset, the undeformed 
metacube. See Methods for experimental details and Supplementary 
Videos 2-4. e, Schematic representation of the deformations at all surfaces 
of the metacube in d. 


which stipulates that the six bonds of each vertex should correspond to 
a brick configuration, whereby the six bricks, x, x*, y", yt, z and zt, 
correspond to spin configurations (0, 7), 72) =(—++), (+——), 
(+—+), (—+-—), (++ —) and ( +), respectively. Evidently, each 
allowed spin configuration corresponds to a compatible brick stacking 
and corresponding voxel configuration. Conversely, each compatible 
voxel configuration corresponds to two spin configurations related by 
parity (spin flip)—a symmetry that originates from the opposite polar- 
izations allowed by each building block. 

All compatible metamaterials thus obtained feature an unusual form 
of long-range order that relates the boundary to the bulk. Owing to the 
reflection symmetry of the bricks, spins along lines of bonds are alter- 
nating. Therefore, spins at opposing boundaries are equal (opposite) 
when their distance is odd (even). Because spins at the surface of a 
metamaterial represent its texture of bumps and dents, this property 
implies that textures at opposite faces of a metacube are directly linked. 
Moreover, once the surface texture is fixed, all internal spins and 
therefore bricks are determined (Fig. 2a). We call this unusual relation 
between surface and bulk ‘holographic order’ (see Methods). 

The combination of parity and holographic order implies that any 
motif consisting of n x p x q compatible blocks can be stacked in a 
space-filling manner, because the surface spins of such motifs have 
compatible textures. Moreover, once the x spins are fixed along a plane, 
we can determine a set containing all motifs A’, B,... with matching 
x spins and by parity obtain A’, B’, ... (Fig. 2b and Methods). These 
motifs can be stacked in an arbitrary order, as long as we alternate 
between ‘+’ and ‘“—’ motifs; this allows the straightforward design 
of periodic, quasi-periodic and aperiodic metamaterials (Fig. 2b). 
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Figure 3 | Pattern recognition and pattern analysis. a, Experimental 
realization of an elastic 5 x 5 x 5 metacube programmed for a ‘smiley’ 
texture. b, Schematic of experiments where the metacube in a is 
compressed between patterned clamps. The red and blue building blocks 
are oriented as in Fig 1; cylinders signify stubs that are positioned at sites 
where oS = 1 and red lines demarcate regions where 0X = of and 
o,=- on. c, Examples of mismatch between cube lock texture {oz} 
K 


(squares) and boundary key textures {o's 


} (circles). Red lines as in b. 


By removing building blocks at the boundary, complex shapes can be 
realized; but for simplicity we focus here on cubic metamaterials. 

Holographic order substantially restricts the number of potential 
compatible configurations: whereas for generic configurations their 
multitude is set by the volume, for compatible configurations it is set 
by the surface area. Moreover, many surface textures lead to forbid- 
den internal vertices such as where all spins are equal. For example, in 
general it is not possible to arbitrarily choose the surface texture at two 
faces simultaneously. Nevertheless, the number of distinct L x L x L 
(equivalent to n x p x q, above, with n= p= q :=L) spin configura- 
tions {2(L) is extremely large. To quantify the design limits and pos- 
sibilities, we exactly evaluated (2(L) up to L= 14 where (23 x 10%, 
and obtained strict and asymptotic lower and upper bounds (Fig. 2c, 
Methods and Supplementary Information). 

Despite the limitations imposed by compatibility, the design space 
of voxelated metamaterials is huge. To illustrate this, we constructed a 
general algorithm to obtain all L x L x 1 motifs that are compatible with 
a given texture {a} (see Methods and Supplementary Information); for 
each texture there are at least two distinct motifs. We now show that 
we can use this algorithm to design arbitrarily pixelated patterns of 
bumps and depressions, or textures, at a given surface of a metacube as 
a step towards arbitrary shape-morphing materials. In Fig. 2d we show 
arationally designed metacube created by 3D printing. Under uniaxial 
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d, Experimental force-compression curves. Colour corresponds to 

the key textures in c. e, The experimentally obtained stiffness k. varies 
systematically with both the area (A) and circumference (C) of the 
mismatch. Coloured data points correspond to the key textures in c, and 
the size of each circle represents k, (the yellow data point at A =C=0 
corresponds to very small k, and so is hard to see). f, The stiffness k, is 
essentially linear in A + C. Coloured data points as in e; small filled grey 
circles correspond to the open circles in e. 


compression (see Methods), the initially flat surface of this cube reveals 
its spatial texture, with the front and back related by holography (Fig. 2e, 
Methods and Supplementary Videos 2-4). This cooperative, com- 
plex, yet controlled, shape morphing illustrates that our combinatorial 
method allows for the rational design of shape-shifting metamaterials. 

We finally show that when aperiodic metacubes are compressed by 
patterned surfaces their response can be used for mechanical pattern 
analysis. We created a compatible 5 x 5 x 5 metacube, programmed 


with a ‘smiley’ texture +{ot} that acts as a ‘lock’ (see Methods), which 
is compressed between two identical surfaces that have a pixelated ‘key’ 
texture {or , created by placing 18 stubs in templated clamps (Fig. 3a, b). 
We characterize the difference between lock and key patterns by the 
area or number of misplaced stubs, A, as well as the circumference of 
the misplaced area, C (Fig. 3c), and use 136 different key patterns, 
which cover all possible values of A and C that can be reached by 18 
stubs. For each key, we performed experiments (simulations) to deter- 
mine the stiffness k, (k,) via the slope of the force-compression 
curves—the values agree very closely with one another (see Methods). 
When the key equals one of the two lock textures + oy , all bricks 
deform compatibly and the stiffness k is low. Incompatibly textured 
surfaces push metacubes into frustrated states, leading to an energy 
penalty and increased stiffness (Fig. 3d). The increase with A evidences 
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simple lock-and-key functionality, but when k is plotted as function of 
A and C, it is seen to increase with C also—for the same number of 
misplaced stubs, a range of stiffnesses can be observed (Fig. 3e). When 
plotted as function of A+C, all our data collapse onto a straight line, 
which evidences intricate collective phenomena (Fig. 3f). We suggest 
that, owing to parity, different parts of the cube deform in opposing 
parity, and that the stiffness is determined by the size of the domain 
walls separating these regions, which is given by A + C (see Methods). 
Together, these findings demonstrate the ability of a metacube to per- 
form an arithmetical calculation on the mismatch between key and lock 
patterns, in behaviour more readily associated with machines than with 
materials. 

Combinatorial strategies enable the design of machine-like materials 
that can be programmed with specific shape-sensing and shape-shifting 
tasks. We anticipate that combinatorial design of textured metama- 
terials can be extended in various directions. First, the inclusion of 
vacancies could lead to multishape materials”’, whereas defects can 
induce controlled frustration to obtain multistability, memory and 
programmability*'!!”. Second, differently shaped building blocks, such 
as triangles or hexagons in two dimensions and truncated octahedra or 
gyrobifastigii in three dimensions, or mixtures of building blocks could 
be used to tile space. Third, building blocks with degrees of freedom 
that are different from the simple ‘inwards or outwards’ deformations 
considered here could be considered—a prime example being ori- 
gami units that have folding motions!*"”. Finally, heating or magnetic 
fields instead of compression could be used to actuate shape-changing 
metamaterials, while non-mechanical textured functionalities such 
as wavefront shaping could also be achieved. We envision a range of 
applications for which control and processing of spatially complex 
mechanical information is key. Textured metamaterials can be designed 
to naturally interface with the complex shapes and shapeability of the 
human body, in prosthetics, haptic devices and wearables. Moreover, 
shape changing is central to a wide variety of actuators and sensors, in 
particular in the context of soft robots?87°, Finally, at smaller scales, 
controllable surface textures could control friction, wetting and drag. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Combinatorial design. The presence of holographic order reduces the number of 
potential compatible L x L x L spin configurations from 2 x 3} (L3 blocks with 3 


orientations and parity) to Be (3 pairs of opposing surfaces each with L? spins). 
The vast majority of these are not compatible and to better understand our design 
space, we discuss how to exactly evaluate and obtain upper and lower bounds for 
§X(L)—the number potentially compatible L x L x L spin configurations. We con- 
struct cubes by stacking motifs, and Q counts the number of L x L x 1 motifs 
compatible with a given texture {a}. 

To understand the possible motifs, we now classify the patterns of z bricks that 
arise in L x L x 1 motifs. Crucially, z bricks are sources or sinks for the in-plane 
spins 7, and oy, and therefore each 2 x 2 submotif can contain only 0, 2 or 4 
z bricks. This restricts the patterns of z bricks to columns, rows, and intersecting 
columns and rows. In general, we can enumerate the patterns of z bricks using 
binary column and row vectors with elements c; and rj, respectively, and placing a 
z brick at location (i, j) only when c; r; (Extended Data Fig. 1a). On the z-bricks, 
the z-spins form checkerboard patterns, whereas the spins in the remainder 
of the pattern can be chosen at will by filling each position with either an x or 
y brick. 

In the absence of z bricks, we can obtain two motifs by fixing 0, and a, to be 
opposite and alternating; that is, at site (i, j) of the motif, 0, = —a)=(— 1)'*J or 
Ox= —Oy=(— 1)'*/*1 Tn both cases, each vertex is compatible with either an x or 
ay brick, with corresponding positive or negative o,. This allows the straightfor- 
ward design of two L x L x 1 motifs consistent with any {o,}. In Extended Data 
Fig. 1b we show these motifs, as well as four more that are compatible with a5 x 5 
smiley texture; hence, Q=6 for this texture. In principle, (2(L) can be exactly 
evaluated by determining Q for each texture {a,}, and then summing over all 
textures (see Supplementary Information and Extended Data Fig. 1c): 
AL)=HD,, 

Lower bound. A lower bound for {2 follows from our construction to create two 
motifs for any spin configuration, which implies that Q > 2. As these can be stacked 
in an arbitrary order, this yields at least 2" spin configurations for a given texture. 
Because there are 2” o, textures, we find that Q(L) > 2” +4. 

Staggered spins. To simplify the counting of the number of compatible spin con- 
figurations {2(L) (for details see Supplementary Information), we define staggered 


spins &, such that for site (i, j, k) in the metacube glhh) =(- Ltt bik) , for 
a=x, y, z. Under this invertible transformation, a checkerboard texture of {o,}, 
for example, corresponds to a homogeneous texture of {&,} where all staggered 
spins are equal to either +1 or —1. Moreover, all sites in a given row, column or 
tube have the same value of &,, Gy Or G;, respectively. 

Upper bound. For a simple upper bound we note that the maximum value of Q is 
obtained when 6,=+1 or 6 =— 1. For each of these textures there are 
Q=2!+1_1 spin configurations. Consider, for example, 6, = + 1. If6, = — 1, then 
all the dy are free, leading to 2t compatible structures; if ¢ = — 1, then all the 
are free, leading to an additional 2' compatible structures. Because &, = G=-1 
was counted twice, the total number is 24 + 24 — 1 =24+!— 1. Hence, we obtain as 
an upper bound: Q(L) < QP (24+1_ 1)" <22/"+2 A stricter upper bound is derived 
in Supplementary Information: (2(L) < 4L x (3/4)! x 22”, The exact evaluation 
of §2(L) is explained in Supplementary Information and the results are given in 
Extended Data Table 1. 

Design limits. We note here that if n is the number of pairs of opposing surfaces 
where the spins can be chosen freely, then (2(L) ~ 2rl’, and that the simple lower 
and upper bounds given above roughly correspond to n=1 and n=2. 
Approximate calculations detailed in Supplementary Information lead to 
gl? +L+logy(3) < ) < 21?+2L+2 and our exact evaluation of 2(L) shows that, for 
large L, (2 is quite close to the lower bound (Fig. 2a). Hence, once the texture of 
one surface is fixed, there is limited freedom, apart from the stacking order of 
motifs, to design textures at other surfaces. For most spin textures, only the two 
simple motifs are compatible and z bricks play a minor role. 

Materials and fabrication. We created the 5 x 5 x 5 specimens by 3D print- 
ing water-soluble moulds, in which we cast a well-calibrated silicon rubber 
(PolyvinylSiloxane, Elite Double 32, Zhermarck, Young’s modulus, E= 1.32 MPa; 
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Poisson's ratio, v0.5). The unit bricks measure 11.46mm x 11.46mm x 11.46mm, 
with a spherical pore of diameter D= 10.92 mm in the centre and four cuboid 
inclusions of dimension 4.20 mm x 4.20mm x 11.46 mm at the x and y corners; 
see Extended Data Fig. 2a. They are stacked with a pitch of a= 11.46 mm, such that 
the filaments between the unit cells have a non-homogeneous cross-section with a 
minimal width d=0.54mm and a depth w= 3.6 mm; see Extended Data Fig. 2b. 

The 10 x 10 x 10 sample has the same brick dimensions and was 3D printed 

commercially (Materialise, Leuven, Belgium) out of sintered polyurethane 
(E= 14 MPa). On the faces of the aperiodic cubes, square pedestals were added 
to facilitate visualization of the surface texture and compression by textured 
boundaries. 
Mechanical tests. We compressed both metacubes at a rate of 0.02 mms! in 
a uniaxial testing device (Instron type 5965), which controls the compressive 
displacement u to better than 10,1m and measures the compressive force with an 
accuracy of 10 N at an acquisition rate of 0.5 Hz. 

Whereas we used flat plates for Fig. 1c, we used textured boundary condi- 
tions for Figs 2d and 3. We created textured top and bottom boundaries using 
aluminium plates with female 3-mm threads positioned in a square array of pitch 
a=11.46+0.02 mm, in which we mount stainless steel M3 screws whose caps 
were machined to a height of 3.50 + 0.01 mm (see Extended Data Fig. 3a); this 
ensures precise levelling of the pins and flexibility in texture. At the start of each 
experiment, the cubes were carefully positioned and aligned by hand within a 
1-mm accuracy on the bottom boundary. The screws were placed to form identical 
(complementary) top and bottom patterns for the 5 x 5 x 5 (10 x 10 x 10) cube. 

For the 10 x 10 x 10 cube, designed as in Extended Data Fig. 4a, we used check- 
erboard textures that led to the desired pattern on one face (Fig. 2), the reverted 
pattern on the opposite face (Fig. 2 and Extended Data Fig. 4b) and checkerboard 
textures on the other faces (Fig. 2 and Extended Data Fig. 4c). 

Numerical simulations. We probed the response of a5 x 5 x 5 aperiodic smiley 
metacube to different textures by performing a fully nonlinear analysis within 
the commercial package Abaqus/Standard. We modelled the elastomer using 
a neo-Hookean strain energy density with a Young’s modulus of E= 1.32 MPa 
and a Poisson’s ratio of v= 0.4999. We carried out a mesh optimization and a 
mesh density study leading to a typical mesh size of 0.6mm and a total number 
of 1.5 x 10° hybrid quadratic tetrahedral elements (Abaqus type C3D10H). We 
applied uniaxial compression by applying 10 steps of magnitude Au=0.25 mm, 
using exactly the same boundary conditions and dimensions as in the experiments 
(Extended Data Fig. 5b). 

Determination of k. The numerical force-displacement (F—u) curves are very 
well fitted by the quadratic form F(u) = ku+ nu?, which captures the effect of the 
nonlinear softening 77 and which allows an accurate estimation of the stiffness k. 
The experimental determination of the stiffness required special care, because small 
gravity-induced sagging of the cube causes a soft knee in the force-displacement 
curve when the top boundary makes contact with the sample. Therefore, we 
determined the stiffness by fitting the force-displacement curves to the same 
quadratic function as for the numerics, focusing on intermediate displacements 
0.8mm <u < 2.5mm away from the knee where dF/du is linear in u; see Extended 
Data Fig. 5a. 

Lock and key mechanics. For the lock-and-key experiments, we used a5 x 5 x 5 
cube made by stacking five B* configurations (Extended Data Fig. 1b). The key 
patterns consisted of 18 screws placed in a 5 x 5 array (see Fig. 3b, c and Extended 
Data Fig. 3a, b). We focused on 136 configurations that have distinct values of 
the area A and circumference C of the texture mismatch. For experiments and 
simulations, we estimated the stiffness k and observe that it increases with the 
mismatch between lock and key. The variation of k in experiments and simula- 
tions closely match (Extended Data Fig. 5c). Neither A nor C are good predictors 
for the level of frustration. To interpret the outcome of the experiments with the 
5x 5x5 cube, we posit that, for incompatible key textures, different parts of 
the cube approach compatible configurations with opposite parity, thus localizing 
the frustration along internal domain walls. Hence, a single misplaced pixel carries 
an energy penalty due to the four frustrated x and y sides of the brick in front of 
the defect, and one z side opposing the defect—when defects touch, their interface 
is not frustrated. Therefore, the size of the domain walls is equal to the number of 
frustrated sides, which is A + C. 
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Extended Data Figure 1 | Motif-based design. a, 2D representation of 
the six bricks (x7, x*, y-, y*, z, Z*), and illustration of complex motifs. 
All complex motifs can be generated by defining two binary vectors with 
elements c; (column) and 1; (row) that govern the placement of z bricks 
at location (i, j). The remaining sites are then filled with x and y bricks. 
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Respecting parity, this generates all motifs for given c and r. b, The six 
motifs (A*, ..., F*) that are compatible with a5 x 5 smiley texture. 

c, A total of 6° smiley metacubes can be designed by varying the stacking 
order; here A~ denotes the same motif as A* but with inversed spins. The 
x and y spins follow from the choice of motifs. 
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Extended Data Figure 2 | Implementation. a, b, Computer assisted design of the geometry of the unit cell (a) anda 5 x 5 x 5 cube (b). All our samples 
were 3D printed with the dimensions a= 11.46mm, D= 10.92 mm and w= 3.6 mm. To make the wall thickness outside the cube equal to the internal 
wall thickness, the outer walls are thickened by 0.27 mm. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


a 


Extended Data Figure 3 | Lock-and-key experiment. a, Picture of the textured clamp. b, Side view of the experiment. 
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Extended Data Figure 4 | 10 x 10 x 10 metacube under uniaxial compression. a, Motif A*—the cube is designed by stacking motifs A* and A~. 
b, Opposite face of the one shown in Fig. le showing the inverted pattern. c, One of the transverse faces showing a checkerboard pattern. 
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“1.0 1.3 1.6 
u (mm) u (mm) ks (N/mm) 
Extended Data Figure 5 | Complex sensory properties of a complex fits with a quadratic function F(u) = ku + nu’ performed in the shaded 
5 x 5 x 5 metacube with internal smiley texture. a, Force-compression region 0.8mm <u <2.5 mm. b, Corresponding numerical results. 
(F-u) curves for five experiments (thick solid lines), where the colour c, Scatter plot showing very good correspondence between the stiffness 
indicates the external texture shown in Fig. 3c. The black thin lines show obtained from simulations (k,) and experiments (k.). 
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Extended Data Table 1 | The exact value of 92 for L x L x L metacubes up to L=14 


L Q 
1 6 
2 450 
3 151,206 
4 145,456,074 
5 325, 148,366,166 
6 1,562,036,085,226,890 
7 17,234,732,991 509,112,246 
8 578,304,084,367,752,824,053,674 
9 84,438,573,424,284,282,414,882,546,966 
10 58,592,971 ,553,875,504,020,753,814,442,326,410 
11 181,442,224 ,689,012,470,542,563,031 429,841 423,983,926 
12 2,404,888,026,041,008,595,056,652,999,310,606,919,098,996, 796,074 
13 126,725,905,761 644,879,286, 362,510,660,061,876,041,719,518,257,045,613,846 


14 30,625,852,190,216,495,511,364,347,343,665,021 ,261,262,812,628,299,779,541,749,100,810 
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Western boundary currents regulated by interaction 
between ocean eddies and the atmosphere 


Xiaohui Ma!, Zhao Jing!?, Ping Chang!*, Xue Liu!?, Raffaele Montuoro’, R. Justin Small*, Frank O. Bryan’, 
Richard J. Greatbatch*.°, Peter Brandt”, Dexing Wu!, Xiaopei Lin! & Lixin Wu! 


Current climate models systematically underestimate the strength 
of oceanic fronts associated with strong western boundary currents, 
such as the Kuroshio and Gulf Stream Extensions, and have difficulty 
simulating their positions at the mid-latitude ocean’s western 
boundaries’. Even with an enhanced grid resolution to resolve 
ocean mesoscale eddies—energetic circulations with horizontal 
scales of about a hundred kilometres that strongly interact with the 
fronts and currents—the bias problem can still persist”; to improve 
climate models we need a better understanding of the dynamics 
governing these oceanic frontal regimes. Yet prevailing theories 
about the western boundary fronts are based on ocean internal 
dynamics without taking into consideration the intense air-sea 
feedbacks in these oceanic frontal regions. Here, by focusing on the 
Kuroshio Extension Jet east of Japan as the direct continuation of the 
Kuroshio, we show that feedback between ocean mesoscale eddies 
and the atmosphere (OME-A) is fundamental to the dynamics and 
control of these energetic currents. Suppressing OME-A feedback in 
eddy-resolving coupled climate model simulations results in a 20-40 
per cent weakening in the Kuroshio Extension Jet. This is because 
OME-A feedback dominates eddy potential energy destruction, 
which dissipates more than 70 per cent of the eddy potential energy 
extracted from the Kuroshio Extension Jet. The absence of OME-A 
feedback inevitably leads to a reduction in eddy potential energy 
production in order to balance the energy budget, which results in a 
weakened mean current. The finding has important implications for 
improving climate models’ representation of major oceanic fronts, 
which are essential components in the simulation and prediction of 
extratropical storms and other extreme events*, as well as in the 
projection of the effect on these events of climate change. 

With the availability of high-resolution satellite observations and 
climate model simulations, it has become increasingly evident that 
ocean mesoscale eddies and fronts can have a profound influence 
on the atmospheric planetary boundary layer’~"". In critical regions 
where prominent oceanic fronts reside, such as the Kuroshio and Gulf 
Stream Extensions, studies have demonstrated that the strong coupling 
between the atmosphere and ocean at meso- and frontal scales can 
be vital in anchoring atmospheric storm tracks**’”!? and providing 
a source of influence on mid-latitude climate variability and weather 
pattern changes**"4. Therefore, it is imperative that we understand the 
fundamental dynamics for maintaining oceanic frontal systems and 
improve climate models’ ability to simulate and predict these important 
oceanic features. 

In classic ocean circulation theory, the Kuroshio Current and its 
extension, like other mid-latitude western boundary current regimes, 
are explained as a part of ocean circulation gyres maintained by the 
large-scale wind stress acting on the ocean’s surface. Through the 
action of planetary waves, the energy input by the winds is transported 
westward and accumulated along the western boundary, giving rise to 


the intense western boundary current—the Kuroshio, which swings 
to the east from Japan to form the energetic Kuroshio Extension Jet 
(KEJ) along the boundary between the North Pacific subtropical 
and subpolar gyres. The KE] is so energetic that it is hydrodynami- 
cally unstable, allowing vigorous ocean eddies to form in its vicinity 
by extracting energy from the mean current, which in turn can feed 
back onto the mean jet >-17, In this classic description, the effect of 
OME-A interaction in the Kuroshio Extension region is not taken into 
consideration. However, recent high-resolution satellite observations 
and coupled atmosphere-ocean model simulations indicate a large 
potential energy release from mesoscale oceanic fronts and eddies to 
the overlying atmosphere. Figure 1 shows the relationship between 
mesoscale sea surface temperatures (SSTs) and surface turbulent 
heat fluxes (THFs) over the Kuroshio Extension region, derived from 
high-resolution satellite observed SST and reanalysis surface THFs. 
Various high-resolution reanalysis products reveal a remarkably 
consistent SST-THF relationship, statistically significant with 
P value less than 0.01, which gives an approximately 40-56 W m7 
THF increase (into the atmosphere) per 1°C SST increase. This 
relationship is well reproduced by an eddy-resolving Community Earth 
System Model (CESM) simulation developed and conducted by the 
National Center for Atmospheric Research (see Methods). It is also 
well reproduced by an ensemble of higher-resolution Coupled Regional 
Climate Model (CRCM) simulations developed and performed at Texas 
A&M University (see Methods), except that the sensitivity of THF to 
SST change is at a higher rate of 56 W m~*°C~*. Whether this strong 
OME-A feedback will have an impact on the KEJ and the associated 
oceanic front is the focus of this study. 

To examine this question, we conduct a set of twin simulations 
using both global and regional climate models (CESM and CRCM). 
The twin simulations consist of a standard high-resolution model 
simulation (the control simulation or CTRL) and an otherwise identical 
simulation except that the influence of mesoscale SST variability on 
the atmosphere is suppressed (hereafter referred to as the Mesoscale 
Eddy Filtered Simulation or MEFS). The suppression of mesoscale 
SST influence in MEFS is achieved by spatially low-pass-filtering the 
simulated SST that is given to the atmospheric model at each coupling 
step, so that the atmosphere does not ‘feel’ ocean eddies even though 
they are simulated by the ocean model (see Methods). As such, OME-A 
feedback is greatly reduced in MEFS and a comparison between CTRL 
and MEFS allows us to assess the impact of OME-A feedback on the 
KEJ. The conceptual design of the experiments is similar to a previous 
study of the effects of the eddies on the Gulf Stream!8, except that here 
a fully coupled—instead of an ocean-only—modelling framework 
is employed to allow more explicit and accurate simulations of the 
OME-A feedback. 

Figure 2a and b displays the vertically averaged mean kinetic energy 
of the upper 250 m of the ocean from the 11-year global CESM CTRL 
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Figure 1 | Mesoscale surface THF and SST relationship. a~d, Composites 
of mesoscale-eddy-induced SST (contour) and THF (colour scale). 

Shown are winter season (ONDJFM) mean anticyclonic warm eddy 

minus cyclonic cold eddy composites in the Kuroshio Extension region 
(30° N-50° N, 140° E-180° E). The white circle marks one eddy radius (R) 
and the white point indicates the eddy centre. Daily surface heat fluxes 
from the European Centre for Medium-Range Weather Forecasts Year of 
Tropical Convection (ECMWF-YOTC, 2008-2010) and from the National 
Centers for Environmental Prediction—-Climate Forecast System Reanalysis 
(NCEP-CFSR, 2002-2010) are used in a and b, respectively, together with 
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and MEFS runs over the North Pacific. It is evident that the suppression 
of OME-A feedback leads to a more meandering and weaker KE] 
(Methods and Extended Data Fig. 1). The difference between the mean 
kinetic energy in CTRL and MEFS clearly shows that over the entire 
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satellite microwave-infrared (MWIR) SST over the overlapping periods. 
Simulated winter daily mean SST and THF from CESM CTRL and CRCM 
CTRL are used in c and d, respectively. The same eddy tracking algorithm 
was applied to the model simulated and observed sea surface height. 

See Methods for details of eddy tracking, composite and data sets used. 
e-h, Scatter plot and linear regression (black lines) between high-pass 
filtered weekly SST and THF in the same region as for the ECMWF-YOTC 
(e), NCEP-CFSR (f), CESM CTRL (g) and CRCM CTRL (h). r’ represents 
the explained THF variance and s is the regression coefficient. The P values 
for all the regression coefficients are below 0.01, on the basis of an F-test. 


North Pacific Ocean it is the KEJ that is most affected by the change in 
OME-A feedback and the more meandering KEJ in MEFS appears to 
shift northward (Fig. 2c). The more meandering and northward-shifted 
KEJ is consistent with the observations that the KEJ axis tends to move 
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Figure 2 | Kuroshio response to OME-A feedback simulated by eddy- 
resolving global and regional climate models. a-c, Vertically averaged 
(upper 250 m) winter-season mean kinetic energy (colour scale) simulated 
in global CESM CTRL (a), MEFS (b) and MEFS minus CTRL (c) based 

on the winter-season (ONDJFM) mean of 11-year CESM simulations. 

d-f, Vertical section of zonally averaged (145° E-160° E) (indicated by the 
dashed vertical lines in a) winter-season mean KEJ velocity U (colour scale 
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and contours in metres per second) across the Kuroshio Extension Front 
based on first-year CESM simulations. The x axis is the distance from the 
axis of KEJ at 0, which is defined by the latitude of the winter-season mean 
maximum U, and the y axis is depth. g-i, As for d-f, but for the ensemble 
mean of zonally averaged winter-season mean U velocity in regional 
CRCM simulations. 
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northward when the KEJ resides in an unstable (more meandering) 
regime'®. Furthermore, the KEJ change occurs within a remarkably 
short time period, which is clearly illustrated by the sharp contrast 
between the zonal mean KEJ averaged over the first year of the CESM 
CTRL and MEFS runs: the KEJ in MEFS is considerably weaker 
and broader than that in CTRL (Fig. 2d-f), similar to the long-term 
averaged KE) change (Extended Data Fig. 1). The rapid KE] response to 
OME-A feedback changes is corroborated by the ensemble of regional 
CRCM runs that show a very similar weakening in the KE) after 
OME-A feedback is suppressed (Fig. 2g-i), indicating the robustness 
of the finding. Because oceanic adjustment timescales to large-scale 
wind forcing through planetary waves are on the order of several years 
to decades at mid-latitudes””, it is unlikely that this rapid KEJ response 
is caused by large-scale atmospheric circulation changes (Methods and 
Extended Data Fig. 2). 

A further probe into the model response to OME-A feedback reveals 
that the weakened KE] is accompanied with a substantial increase in 
mesoscale eddy energy when OME-A feedback is suppressed in MEFS. 
Figure 3 shows the ratio (MEFS/CTRL) between wavenumber spectra 
of eddy kinetic energy (EKE) derived from the suppressed-feedback 
and full-feedback simulations in the Kuroshio Extension region, as 
well as the ratio between wavenumber spectra of enstrophy (half of 
the vorticity variance), for both global CESM and regional CRCM 
experiments. Both model experiments show an increase in EKE and 
enstrophy for wavelengths less than about 500 km when comparing 
MEFS to CTRL, but the largest increase occurs at wavelengths less than 
100 km (Fig. 3). In particular, the global CESM simulations exhibit 
a nearly 30% EKE increase and a 50% enstrophy increase at this 
mesoscale eddy range, while the increases in the regional CRCM twin 
ensembles are slightly smaller, probably owing to the short integration 
period. The energized mesoscale eddy activity implies a change 
in eddy energy balance; we therefore asked whether the KEJ response 
to OME-A feedback change can be explained in terms of the change in 
mesoscale eddy energetics. 

To shed light on this question, we perform an ocean eddy potential 
energy (EPE) analysis using the twin ensembles of regional CRCM 
simulations (see Methods and Extended Data Fig. 3 for details). The 
rate of EPE change is determined by the interplay between energy 
production and destruction processes (see equation (1) in Methods). 
The production is primarily achieved through converting the mean 
available potential energy (MAPE) stored in the mean flow to EPE 
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Figure 3 | Simulated energy and enstrophy spectra in the Kuroshio 

Extension region. a, c, Ratio (MEFS/CTRL) of winter-season (ONDJEM) 

mean EKE spectra in the Kuroshio Extension region (145° E-160° E, 

30° N-42° N) based on the 11-year CESM simulations and the twin 

ensembles of CRCM simulations, respectively. b, d, As for a and c, but for 

eddy enstrophy (ENS; half of the vorticity variance) spectra. Eddy velocity 
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(hereafter referred to as MAPE-to-EPE conversion). The destruction 
consists of two processes: one converts EPE to EKE to energize eddy 
circulation (hereafter referred to as EPE-to-EKE conversion) and 
the other acts to dissipate EPE through either oceanic mixing or 
exporting eddy energy to the atmosphere (hereafter referred to as EPE 
dissipation)'*. Thus, it is the EPE dissipation that is strongly affected 
by OME-A feedback. 

To quantify the extent to which OME-A feedback can affect the 
EPE dissipation, we analyse and compare the vertically integrated 
EPE budget over the upper 250 m of the ocean in the twin ensembles 
of regional CRCM simulations (inset tables in Extended Data Fig. 3). 
With full OME-A feedback, CRCM CTRL reveals that nearly 74% of 
the EPE extracted from MAPE is lost owing to EPE dissipation and 
less than 22% is converted to EKE (the remaining 4% is accounted for 
by the EPE change during the integration period). This result clearly 
indicates that in the presence of OME-A feedback, nearly three- 
quarters of the Kuroshio EPE production is dissipated and only a small 
portion (less than a quarter) is used to drive eddy circulation. In sharp 
contrast, when OME-A feedback is suppressed, CRCM MEFS shows 
that the roles of EPE dissipation and of EPE-to-EKE conversion in 
the EPE budget are reversed: more EPE is being converted to EKE 
(about 47%) than being dissipated (about 40%). The increase in EPE- 
to-EKE conversion is consistent with the enhanced mesoscale eddy 
activity in MEFS as shown in Fig. 3. Thus, it is very clear that OME-A 
feedback is essential for dissipating ocean EPE and maintaining the 
EPE budget. 

The change in EPE dissipation between the twin ensembles of 
regional CRCM simulations provides the key as to why the simulated 
KEJ behaves differently with and without OME-A feedback. With the 
full feedback, the Kuroshio system experiences strong dissipation. To 
generate and maintain an active eddy field, the KEJ must be sufficiently 
energetic to allow strong conversion from MAPE to EPE in order to 
overcome the vigorous EPE dissipation owing to OME-A feedback. 
With the suppressed OME-A feedback, EPE dissipation is greatly 
reduced, leaving the Kuroshio system in a less dissipative regime. In 
such a regime, more EPE is being converted to EKE than is being 
dissipated to the atmosphere, leading to a more energetic eddy field 
(Fig. 3) and a more meandering and weaker KE] (Fig. 2 and Extended 
Data Fig. 4). A schematic summary of the proposed mechanism is 
shown in Fig. 4, which is supported by a detailed EPE budget analysis 
(Methods and Extended Data Fig. 3). 
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is defined as the departure from the zonal mean between 145° E and 

160° E. The shading indicates the geometric standard deviation of the 
EKE and the enstrophy ratio in individual years of the CESM simulations 
(a and b) and of CRCM simulations (c and d). The vertical dashed line 
represents eddies with 100-km wavelength. The horizontal dashed line 
indicates the MEFS/CTRL ratio of unity. 
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Figure 4 | Schematic of OME-A feedback regulating the KEJ. The 
Kuroshio Current swings to the east from Japan to form the energetic 
KE] (yellow dashed contours), flowing eastward underneath the Pacific 
Jet Stream. The KEJ is hydrodynamically unstable, shedding eddies by 
converting KEJ’s MAPE to EPE (white arrows). Energetic anticyclonic 
warm eddies (red circles) release THFs to the atmosphere (spiral arrows), 
and energetic cyclonic cold eddies (peach circles) absorb THFs from the 
atmosphere (spiral arrows), interacting with the overlying atmospheric 
planetary boundary layer and causing intense OME-A feedback. The 


These results from the high-resolution coupled atmosphere- 
ocean model experiments offer a new perspective on the dynamics 
controlling the Kuroshio Extension Front and Jet, revealing that local 
OME-A feedback needs to be taken into consideration by classic ocean 
circulation theory. This insight into the complex dynamics governing 
Kuroshio variability has important implications for the improvement of 
climate models in simulating the Kuroshio Extension Front and eddies 
as well as their impact on climate. Recent studies have shown that the 
sharp SST gradient and the eddy-induced mesoscale SST variability 
along the Kuroshio Extension Front can have an impact on the Pacific 
storm track>®}, increasing the likelihood that an improved simulation 
of Kuroshio eddies and their feedback with the atmosphere in climate 
models may lead to improvements in short-term prediction of 
extratropical weather patterns and long-term projection of the 
storm track response to future climate change. Although the focus of 
this study is on the KE], the findings of this study are applicable 
to other oceanic fronts, such as the Gulf Stream front (Methods 
and Extended Data Fig. 5) and probably also Southern Ocean fronts, 
where strong OME-A feedback exists!””-”. Not only are these oceanic 
frontal zones hotspots of the oceans where extratropical ocean 
influence on the atmosphere may be particularly strong”’, they 
are also the regions where present climate models have great 
difficulties in reproducing observations**”’. The results presented 
here may shed light on how to understand and reduce model biases in 
these critical regions and improve climate simulations and projection”. 
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feedback dominates EPE destruction (thicker black arrow) that dissipates 
74% of the EPE generated from the KEJ in CRCM CTRL, leaving only a 
small portion (about 22%) to drive eddy circulation through EPE-to-EKE 
conversion (thinner black arrow). With suppressed feedback in CRCM 
MEFS, EPE dissipation (EPD) drastically reduces to 40% while EPE-to- 
EKE conversion increases to 47%, causing a more meandering and weaker 
KEJ. The EPE cycles in CTRL (red) and MEFS (blue) are illustrated in the 
inset diagram and further discussed in Methods and Extended Data Fig. 3. 
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METHODS 


Model configuration and experiment design. The high-resolution eddy-resolving 
global climate simulations in this study are conducted by the National Center 
for Atmospheric Research using the Community Earth System Model (CESM). 
The model configuration includes the Community Atmosphere Model version 
5 (CAMS) with a spectral element dynamical core, the Parallel Ocean Program 
version 2 (POP2), as well as sea ice and land models”. CAM5 and POP2 have 
horizontal resolutions of about 0.25° and about 0.1°, respectively. The ocean model 
provides SST and surface current velocity to the atmosphere model and receives 
updated fluxes from the atmosphere model calculated using the Large and Yeager*! 
surface layer scheme every 6h (refs 2 and 32). CESM CTRL was integrated for 100 
years with 14 years of spin-up. Starting on 1 March of year 59, the twin MEFS was 
conducted with the same settings as CTRL except that a low-pass spatial filter was 
applied to the POP2 simulated SST before being passed to CAM5 at each coupling 
step to remove mesoscale SST variability. The ocean in turn was forced by the 
atmospheric surface fluxes, which contained no signature of the mesoscale SST. 
The low-pass filter used was a 1,000 km x 1,000km rectangular boxcar filter. MEFS 
was integrated for 11 years. The twin simulation comparison shown in this study is 
based on the 11-year CESM CTRL and MEFS simulation from year 59 to year 70. 
The Coupled Regional Climate Model (CRCM) was developed at Texas A&M 
University to perform simulations at high resolution. The model consists of a 
custom coupling software framework that allows frequent mass, momentum 
and energy exchanges at the interface between its ocean and atmosphere regional 
model components. CRCM can efficiently perform remapping of interface fields 
as well as spatial filtering of ocean surface quantities at runtime, and provides a 
set of different surface flux formulations. It includes the Weather Research and 
Forecasting*® (WRF) Model developed by the National Center for Atmospheric 
Research as the atmospheric component and the Regional Ocean Modelling 
Systems (ROMS) developed by Rutgers University and UCLA as the oceanic 
component. The CRCM is configured with a common Arakawa-C grid shared 
by ROMS and WRE at 9-km horizontal resolution extending from 3.6° N to 
66° N and from 99° E to 270° E to cover the entire North Pacific Ocean. WRF 
is divided into 30 vertical levels and its detailed configuration follows a recent 
study®. ROMS has 50 levels in a vertical terrain-following coordinate and includes 
a K-profile parameterization turbulent mixing closure scheme* for vertical mixing 
and a bi-harmonic horizontal Smagorinsky-like mixing for the momentum®. In 
CRCM simulations, the coupling strategy was similar to that of CESM except that 
WRFP’s flux formulations were used and WRF and ROMS were coupled hourly. 
The CRCM CTRL comprised an ensemble of five 6-month integrations, initialized 
on 1 October 2003, 2004, 2005, 2006 and 2007, respectively. Each of these initial 
conditions for ROMS was derived from a 6-year spin-up run, carried out using 
version 2 of the Coordinated Ocean-ice Reference Experiments (CORE-II)*! as 
surface forcing and 5-day averaged Simple Ocean Data Assimilation (SODA)** 
output as lateral boundary conditions. WRE initial conditions were derived 
from the NCEP-Department of Energy AMIP-II Reanalysis (NCEP-II)*”. The 
lateral boundary conditions for the CRCM CTRL ensemble were derived from 
6-hourly NCEP-II reanalysis for WRF and 5-day average SODA output for 
ROMS. The set-up of the MEFS ensemble was identical to CTRL except that the 
ROMS-simulated SST was processed by an embedded low-pass Loess filter with a 
15° (longitude) x 5° (latitude) cut-off wavelength** before being provided to WRF 
at each coupling step. 
Weakened KEJ in CESM MEFS. The weakened KE] in global CESM simulations 
when OME-A feedback is suppressed can be shown more clearly by a composite 
analysis of zonally averaged winter-season (ONDJFM) mean U over the Kuroshio 
Extension region. The composite was taken according to the KEJ axis defined 
by the latitude of maximum U in each winter. Extended Data Fig. 1 shows the 
11-winter-season composite of zonal-mean U from CTRL, MEFS and MEFS minus 
CTRL. The long-term averaged KEJ change shown in Extended Data Fig. la-c 
is remarkably similar to the first-year KEJ change shown in Fig. 2d-f and to the 
ensemble mean KEJ change simulated by regional CRCM shown in Fig. 2g-i. 
Influence of the large-scale atmospheric circulation. To further confirm that the 
rapid KEJ response is unlikely to be caused by large-scale atmospheric circulation 
changes, Extended Data Fig. 2a shows the annual mean large-scale wind stress curl 
averaged over the 11-year global CESM CTRL simulation, which is based on spatially 
low-pass filtered wind stresses using a 10° (longitude) x 10° (latitude) boxcar filter. 
As a part of the subtropical gyre in the North Pacific, the Kuroshio is driven by the 
large-scale negative wind stress curl generated by the tropical easterly trade winds 
and mid-latitude westerlies. In the global CESM MEFS when OME-A feedback is 
suppressed, this large-scale wind stress curl does not exhibit much change (Extended 
Data Fig. 2b). Similar results are found in the regional CRCM MEFS and CTRL. 
These findings further suggest that the substantial weakening of KEJ shown in the 
main text cannot be attributed to changes in large-scale atmospheric circulations and 
is more probably influenced by the change in the local OME-A feedback. 


Energy budget for EPE. In this study, we use the twin ensembles of short-term 
(6-month winter season) regional CRCM integrations, instead of the longer- 
term global CESM simulations, for the energetic analysis, mainly because of a 
programmatic reason: output of high-resolution (both in space and time) oceanic 
variables required for a detailed eddy energetic analysis is currently unavailable 
for the existing global simulations. Additionally, there are a number of scientific 
justifications for using the short-term regional CRCM simulations. First, the 
longer-term simulations allow planetary wave adjustment to affect the KEJ 
response to changes in large-scale atmospheric forcing, making it difficult to 
evaluate the role of local OME-A feedback in Kuroshio eddy energetics (although 
in our case we have demonstrated above that this is not a factor). Second, the 
global filtering adopted in the CESM MEFS can potentially affect tropical ocean 
circulations, which in turn can remotely influence Kuroshio eddy energetics, again 
complicating the interpretation of the OME-A feedback effect on Kuroshio eddy 
energetics. These problems are remedied by the regional modelling approach. 
However, the short integration period in this approach prevents an examination 
of equilibrium-state eddy energetics. Nevertheless, an analysis of transient-state 
eddy energetics can still be enlightening in understanding the role of OME-A 
feedback in maintaining oceanic eddies. 

In analysing the eddy energy budget, EPE is defined as the potential energy 
associated with zonally perturbed flows between 145° E-160° E. Sensitivity 
tests suggest that using different regions (for example, 150° E-165° E) does not 
substantially affect the EPE budget analysis. The governing equation for EPE* is: 
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where (...) denotes the horizontal area average, b is the buoyancy, up, = (u, v) is the 
horizontal velocity, w is the vertical velocity, Vi = (0/0x, 0/Oy) po= 1,025kgm 7 
is the water density, and N, is the buoyancy frequency associated with a predefined 
background potential density profile p, that is chosen to be the area average of the 
time-mean potential density (referenced to the sea surface) in the analysis domain 
(145° E-160° E, 30° N-42° N). Variables denoted by primes and bars correspond 
to the zonal perturbed and mean values, respectively. 

The term on the left-hand side of equation (1) is the time rate of change of EPE, 
which is determined by the production and destruction terms on the right-hand 
side of equation (1). EPE production comes from the two processes: advection of 
EPE (the first term on the right-hand side of equation (1)) and eddy buoyancy 
fluxes acting on the mean meridional buoyancy gradient (the second term on 
the right-hand side of equation (1)), the combined effect of which is primarily to 
convert MAPE to EPE and which is therefore referred to as baroclinic conversion 
(that is, MAPE-to-EPE conversion in the main text). Destruction of EPE is 
described by the third and fourth terms on the right-hand side of equation (1): the 
vertical eddy buoyancy flux that converts EPE to EKE to energize eddy circulation 
(that is, EPE-to-EKE conversion) and EPE dissipation (EPD) through either 
oceanic mixing or exporting eddy energy to the atmosphere, as depicted in the 
main text. These terms are computed using daily mean diagnostic output from 
the CRCM simulations and the equation is exactly balanced. 

The result of EPE analyses from the twin ensembles of regional simulations 
in the Kuroshio Extension region is summarized in Extended Data Fig. 3. In the 
CTRL ensemble with full OME-A feedback, it is evident that within the upper 
50m the main balance in the EPE budget is between the baroclinic conversion and 
EPE dissipation, both of which peak near the surface and decrease exponentially 
with depth. The EPE-to-EKE conversion is negligibly smaller near the surface, but 
becomes a dominant factor below 50m. The tendency term, although non-zero 
due to the short integration, in general makes a minor contribution to the EPE 
budget in the Kuroshio Extension region. Upon a vertical integration from the 
surface to 250m (inset tables in Extended Data Fig. 3), the CRCM CTRL ensemble 
reveals a 32.2mWm ~ baroclinic conversion, a 23.7 mW m2 EPE dissipation, a 
7.1mW m~ EPE-to-EKE conversion, and a 1.4mW m~” time rate of change, 
confirming that in the presence of OME-A feedback, most of the Kuroshio EPE, 
rather than being converted to EKE, is dissipated as illustrated in Fig. 4. 

Since OME-A feedback is expected to strongly affect EPE dissipation because 
of the close mesoscale THF-SST relationship shown in Fig. 1, it is no surprise that 
the EPE budget is greatly altered in the CRCM MEFS ensemble where OME-A 
feedback is suppressed. From Extended Data Fig. 3, it is abundantly clear that 
the EPE dissipation is reduced by approximately threefold, particularly near the 
surface, in MEFS compared to CTRL, confirming the role of OME-A feedback in 
removing EPE to the atmosphere. To keep the EPE budget balanced, the baroclinic 
conversion (production term) is also greatly reduced, albeit not as much as the EPE 
dissipation reduction. Interestingly, EPE-to-EKE conversion is even enhanced in 
MEFS, indicating that more EPE is converted to EKE in the absence of OME-A 
feedback. These changes are more conveniently summarized in terms of a vertically 
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integrated (from surface to 250m depth) EPE budget (inset tables in Extended 
Data Fig. 3). Compared to CTRL, the most important change in MEFS occurs in 
EPE dissipation, which is reduced by approximately 69% to 7.4mW m ? (from 
23.7mW m ”). EPE-to-EKE conversion increases by approximately 20% to 
8.5mW m ? (from 7.1mW m””); combining the EPE dissipation and the EPE- 
to-EKE conversion in MEFS gives rise to a total reduction of EPE destruction by 
approximately 48% to 15.9mW m ? (from 30.8 mW m ”). Baroclinic conversion 
reduces by approximately 43% to 18.3mW m * (from 32.2mW m ”). Finally, 
the rate of EPE change, though increased from 1.4mW m’*to2.4mWm”, 
is still minor compared to other terms. Therefore, in the absence of OME-A 
feedback, even though EPE production is reduced, there is actually more EPE 
being converted to EKE and less EPE being dissipated, supporting the proposed 
mechanism, illustrated in Fig. 4. The major reduction in EPE dissipation also 
causes an increase in EPE during the integration period, resulting in an enhanced 
rate of EPE change, although its contribution to the total EPE budget remains small. 
Since EPE production by baroclinic conversion is through down-gradient eddy 
buoyancy fluxes acting on the mean buoyancy gradient**“", its major reduction in 
the absence of OME-A feedback implies that there is a decrease in either the eddy 
buoyancy fluxes or mean buoyancy gradient. The former is an unlikely candidate 
because mesoscale eddy activity is energized, that is, both EPE and EKE increase, 
in MEFS compared to CTRL. It means that a weakened mean buoyancy gradient 
is a more likely consequence of OME-A feedback suppression in MEFS. Further 
analyses indicate that salinity makes a secondary contribution to the change 
of mean buoyancy gradient, and thus the decrease in baroclinic conversion is 
mainly due to the weakened potential temperature gradient. Extended Data Fig. 4 
shows a comparison of the meridional potential temperature gradient composite 
along the Kuroshio Extension in global and regional simulations. The composite 
was referenced to the KEJ axis in each winter, as in the composite analysis of 
zonal-mean U. As expected, the strongest temperature gradient collocates with the 
KE] owing to geostrophy. When OME-A feedback is suppressed, the temperature 
front is weakened by about 15% in MEFS compared to CTRL, corresponding to the 
broader and weaker KEJ shown in Fig. 2. The weakened temperature front and KEJ 
along with the reduced baroclinic conversion support the argument that less EPE 
needs to be extracted from MAPE to sustain the more energetic eddy field when 
OME-A feedback is suppressed, owing to the much reduced EPE dissipation and 
enhanced EPE-to-EKE conversion. 
Change of mean flow and eddy activity in the Gulf Stream. To examine whether 
similar changes occur in the Gulf Stream extension region, we also analysed 
change of mean flow and eddy activity in the Gulf Stream extension region in the 
global CESM CTRL and MEFS simulations. Similar to the KEJ response in the 
North Pacific, we found a substantial weakening (about 25%) of the Gulf Stream 
Extension current in global MEFS when OME-A feedback is suppressed (Extended 
Data Fig. 5a—c) and an increase of EKE and enstrophy (approximately 20%-40%) 
at mesoscale range (below about 500 km) in MEFS (Extended Data Fig. 5d, e). 
The consistent mean flow and eddy activity response to suppression of OME-A 
feedback in the Kuroshio Extension region and Gulf Stream Extension further 
supports our hypothesis that OME-A feedback may be essential in regulating 
western boundary current regimes. 
Horizontal wavenumber spectra of EKE and enstrophy. The horizontal 
wavenumber spectra of EKE and enstrophy in Fig. 3 and Extended Data Fig. 5 
are computed as 
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where (u, v) are the zonal and meridional current components, ¢ is the vertical 
component of relative vorticity, ky and @ are the magnitude and azimuth 
of the horizontal wave vector, and tilde represents the two-dimensional 
Fourier transform. Before computing the two-dimensional Fourier transform, a 
zonal mean is subtracted from daily mean (u, v) and ¢ to remove the background 
flow. 

Data. The 0.25° daily satellite altimetry sea surface height data—AVISO” 
(http://www.aviso.altimetry.fr/en/data/products/sea-surface-height-products.html) 
—are used to identify eddies in the Kuroshio Extension region combined with an 
eddy-tracking algorithm’. The European Centre for Medium-Range Weather 
Forecasts Year of Tropical Convection“ (http://apps.ecmwf.int/datasets/) product 
at T799 spectral truncation (approximately 25 km) and the NCEP Climate Forecast 
System Reanalysis*> (CFSR; http://cfs.ncep.noaa.gov/cfsr/downloads/) product at 
T385 spectral truncation (approximately 38 km) are used to derive the daily surface 
heat fluxes as shown in Fig. 1. The corresponding SST field is derived from 9-km 
daily satellite microwave-infrared SSTs (http://www.remss.com/measurements/ 
sea-surface-temperature). Before making the eddy composite in Fig. 1, all eddies 
are normalized by their radius and rotated so that the background wind direction 
is directed to the east, following a recent study”. 

Code availability. The Community Earth System Model (CESM) developed 
by the National Center for Atmospheric Research can be downloaded online 
(http://www2.cesm.ucar.edu/). The Coupled Regional Climate Model (CRCM) 
developed at Texas A&M University can be obtained by written request to 
R.M. (rmontuoro@tamu.edu). 
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Extended Data Figure 1 | Long-term mean Kuroshio response to 11-year CESM simulations in CTRL (a), MEFS (b) and MEFS minus CTRL 
OME-A feedback simulated by CESM. Composite of winter-season (c). The composite was taken according to the axis of the winter mean KEJ 
(ONDJEM) mean U velocity (colour scale and contours in metres per defined by the latitude of maximum U velocity. The x axis is the distance 


second) zonally averaged between 145° E and 160° E for the whole of the from the KEJ axis. 
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Extended Data Figure 2 | Large-scale wind stress and curl change 
simulated by CESM. Annual mean large-scale wind stresses (vectors) 
and wind stress curl (colour scale) in CTRL (a) and MEFS minus CTRL 
(b) averaged over the 11-year CESM simulations. The large-scale wind 


stresses are obtained using a 10° (longitude) x 10° (latitude) boxcar filter. 
The wind stress curl differences, significant with P value less than 0.05, 


based on monthly mean data using a Student's t-test are marked by red 
crosses in b. 
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Extended Data Figure 3 | EPE budget in the Kuroshio Extension region. 
a, Vertical profile of winter season (ONDJFM) mean EPE budget averaged 
over 145° E-160° E, 30° N-42° N for the CRCM CTRL ensemble (solid) 
and CRCM MEFS ensemble (dashed). b, As for a, except for MEFS minus 
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CTRL. The inset tables summarize the vertically integrated EPE budget in 
the upper 250 m for CTRL and MEFS in a and for MEFS minus CTRL in b. 
Each of the EPE budget terms is indicated using a different colour, and 
solid and dashed lines in a represent CTRL and MEFS, respectively. 
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Extended Data Figure 4 | Temperature gradient response in the (a), MEFS (b) and MEFS minus CTRL (c) based on 11-year CESM 
Kuroshio Extension region simulated by CESM and CRCM. simulations. The x axis is the distance from the KEJ axis at 0 defined by the 
a-c, Composite of winter-season mean meridional temperature gradient latitude of winter mean maximum U and the y axis is depth. d-f, As for 


(colour scale and contours in degrees Celsius per degree latitude) averaged —_a-c, but for the CRCM twin ensembles. 
between 145° E and 160° E across the Kuroshio Extension Front in CTRL 
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Extended Data Figure 5 | Mean flow and eddy response to OME-A 
feedback in the Gulf Stream simulated by CESM. a-c, Composite of 
winter season mean U velocity (colour scale and contours in metres 

per second) along 60° W based on 11-year CESM simulations. The 
composite was taken according to the axis of the winter mean Gulf Stream 
current along 60° W defined by the latitude of maximum U velocity. 
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The x axis is the distance from the Gulf Stream current. d, Ratio of EKE 
spectra between MEFS and CTRL in the Gulf Stream Extension region 
(35° N-42° N, 70° W-55° W) based on 11-year CESM simulations. 

e, As for d, but for eddy enstrophy (ENS) spectra. The grey shading 
indicates the geometric standard deviation of EKE and enstrophy ratio in 
individual years of the CESM simulations (d and e). 
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High-resolution seismic constraints on flow 
dynamics in the oceanic asthenosphere 


Pei-Ying Patty Lin'}, James B. Gaherty!, Ge Jin!, John A. Collins’, Daniel Lizarralde’, Rob. L. Evans? & Greg Hirth? 


Convective flow in the mantle and the motions of tectonic plates 
produce deformation of Earth’s interior, and the rock fabric 
produced by this deformation can be discerned using the anisotropy 
of the seismic wavespeed!~3. This deformation is commonly inferred 
close to lithospheric boundaries beneath the ocean in the uppermost 
mantle, including near seafloor-spreading centres as new plates are 
formed via corner flow’, and within a weak asthenosphere that 
lubricates large-scale plate-driven flow and accommodates smaller- 
scale convection™®. Seismic models of oceanic upper mantle differ as 
to the relative importance of these deformation processes: seafloor- 
spreading fabric is very strong just beneath the crust-mantle 
boundary (the Mohorovici¢ discontinuity, or Moho) at relatively 
local scales”, but at the global and ocean-basin scales, oceanic 
lithosphere typically appears weakly anisotropic when compared to 
the asthenosphere®’”. Here we use Rayleigh waves, recorded across 
an ocean-bottom seismograph array in the central Pacific Ocean 
(the NoMelt Experiment), to provide unique localized constraints 
on seismic anisotropy within the oceanic lithosphere-asthenosphere 
system in the middle of a plate. We find that azimuthal anisotropy is 
strongest within the high-seismic-velocity lid, with the fast direction 
coincident with seafloor spreading. A minimum in the magnitude 
of azimuthal anisotropy occurs within the middle of the seismic 
low-velocity zone, and then increases with depth below the weakest 
portion of the asthenosphere. At no depth does the fast direction 
correlate with the apparent plate motion. Our results suggest that 
the highest strain deformation in the shallow oceanic mantle occurs 
during corner flow at the ridge axis, and via pressure-driven or 
buoyancy-driven flow within the asthenosphere. Shear associated 
with motion of the plate over the underlying asthenosphere, if 
present, is weak compared to these other processes. 

The NoMelt experiment was designed to provide localized con- 
straints on the lithosphere-asthenosphere system in a normal oceanic 
tectonic plate, far from the influence of melting, either at the ridge or 
due to subsequent intraplate volcanism. NoMelt is centred on mature 
(70 million years old) Pacific sea floor southeast of Hawaii, between 
the Clarion and Clipperton fracture zones (Fig. 1). It has an aperture 
of 600km x 400km and consists of a broadband ocean-bottom seis- 
mograph (OBS) seismic array, a long-period magnetotelluric survey, 
and an active-source seismic refraction/reflection survey using a short- 
period OBS array. The year-long broadband OBS array recorded a large 
collection of shallow-source earthquakes from around the Pacific basin, 
and the vertical-component recordings of the Rayleigh surface waves 
from these events are exceptionally high quality within the period band 
20-160, once the effects of ocean infragravity waves and tilt noise are 
removed (see Methods) (Extended Data Fig. 1). By applying ambient- 
noise interferometry, we extend the Rayleigh-wave observations into a 
shorter-period (10-20s) band. 

Surface waves traverse NoMelt at a wide range of back-azimuths, pro- 
viding exceptional azimuthal coverage for estimating shear velocities 


and azimuthal anisotropy within the array footprint. We utilize a wide- 
band intra-array cross-correlation analysis to measure Rayleigh-wave 
phase velocities in the earthquake band (20-160s), and a spectral 
fitting procedure to estimate phase velocity in the ambient-noise band 
(10-20 s) (see Methods). We find minimal lateral variation in phase 
velocity within the array, and the observed phase velocities can be 
expressed by a single, average isotropic phase velocity curve for the 
entire array, plus a periodic cos(26) variation in phase velocity as a 
function of azimuthal direction 6 for each frequency (Fig. 2). Such 
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Figure 1 | Layout of NoMelt deployment on a bathymetric map of the 
central Pacific, with seafloor age (in millions of years) shown as white 
contours. Solid yellow circles represent broadband OBSs that produced 
good data; open and grey circles show locations of instruments that 

were unrecovered or produced poor data, respectively. Red circles and 
green triangles show locations of short-period OBSs and magnetotelluric 
instruments, for reference. Seafloor bathymetry was extracted from NOAA 
(https://www.ngdc.noaa.gov/mgg/global/global.html), and the seafloor 
age is available from EarthByte (http://www.earthbyte.org/Resources/ 
Agegrid/1997/digit_isochrons.html). 
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azimuthal anisotropy is diagnostic of deformation-induced olivine 
fabric in the mantle’, and we observe a very strong peak at ~80° azimuth 
at periods of 14-40s. The anisotropy appears to weaken at intermediate 
periods, reaching a minimum ata period of about 80s before strength- 
ening and rotating slightly towards a more east-west direction at the 
longest periods (Fig. 2 and insets in Fig. 3b and c). 

The average of all the phase velocity dispersion measurements can be 
inverted to estimate the isotropic one-dimensional vertically polarized 
shear-velocity structure (V;,) beneath the array. We utilize an iterative, 
linearized least-squares inversion algorithm, with an additional Monte 
Carlo scheme to evaluate uncertainty associated with choice of starting 
model. The best-fitting V,, profiles (Fig. 3a) are characterized by a thin 
high-velocity lid overlying a modest low-velocity zone, similar to other 
local and/or regional models of the Pacific!”!*. The absolute veloci- 
ties in both the lid and the low-velocity zone, as well as the transition 
between them (50-80 km depth), are very tightly constrained owing to 
the small uncertainty in the average phase velocity observations. The 
inferred seismic velocities are consistent with electrical conductivity 
profiles that suggest a dry lid overlying a modestly damp asthenosphere 
at subsolidus temperatures'*—no melt appears to be required to pro- 
duce either the low-velocity zone! or the high-conductivity zone", 
although very small amounts of melt cannot be ruled out. Regardless of 
melt content, the rheological profile across the system should correlate 
strongly with the shear-velocity profile, with a minimum in viscosity 
that coincides roughly with the middle of the low-velocity zone’®. 

We invert the frequency-dependent cos(26) variations in phase 
velocity for 20 models of azimuthal anisotropy (the G parameter) as 
a function of depth (Fig. 3b and c) using a smooth parameterization 
and a linearized least-squares algorithm (see Methods). The best- 
fitting models show the strongest anisotropy in the lithosphere with 
the fast direction parallel to the fossilized (dating to plate formation) 
seafloor-spreading direction (78°) estimated from the orientation of 
nearby fracture zones, and then decreasing in magnitude with depth 
into the asthenosphere. The weakest azimuthal anisotropy is centred 
in the asthenosphere at 100-150km depth, and then it increases to 
a secondary maximum at about 250 km depth. Fast directions rotate 
modestly from the fossil-spreading direction within the lithosphere to 
amore east-west direction at depth. The data provide clear evidence for 
this basic layered structure (insets to Fig. 3b and c): strong G anisotropy 
at the Moho is driven by the peak in azimuthal velocity variations near 
20-s period; the decay in G amplitude is required by the steep reduction 
in azimuthal variation in the period band 20-505; the secondary maxi- 
mum in G amplitude near 250 km depth is required by the persistence 
of peak-to-peak azimuthal variations of ~3% out to the longest period; 
and the rotation in G azimuth with depth in the mantle is required by 
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the rotation of the fast propagation direction from ~75° to ~90° across 
the period band 50-1405. The preferred models are not consistent with 
anisotropy in the asthenosphere aligned with apparent plate motion 
(APM) in a hotspot reference frame (Fig. 3c). 

The models of azimuthal anisotropy have several key implications 
for the nature of deformation and flow associated with oceanic tec- 
tonic plates. First, corner flow at the mid-ocean ridge represents the 
dominant fabric-forming process in the shallow oceanic mantle. This 
mechanism has long been recognized as important for explaining 
olivine fabric observed in ophiolites'’ and abyssal peridotites'® and 
the presumably related seismic anisotropy observed in the oceanic 
lithosphere’, particularly at the shallow depths constrained by active- 
source seismic refraction experiments”*!’. Our results suggest that 
this fabric extends down at least 50 km into the cooling plate, providing 
new constraints on models of ridge dynamics’. The strength of this 
anisotropy is noticeably greater than that associated with deforma- 
tion in the underlying asthenosphere. This high-resolution result is 
at odds with Pacific-wide studies of seismic anisotropy, which argue 
for a weakly anisotropic lithosphere underlain by a highly anisotropic 
asthenosphere*””, This difference probably reflects the limited ability of 
large-scale studies to resolve structure imparted by seafloor spreading”!, 
which exhibits lateral heterogeneity over short length scales, as 
compared to larger-scale processes such as apparent plate motion or 
density-driven flow”””?, 

Below the plate, our results suggest that the strongest deformation is 
induced by dynamic flow within the asthenosphere, rather than passive 
shear strain associated with motion of the plate over the underlying 
mantle. This conclusion is supported by the absence of an APM signal 
and by the pattern of anisotropic strength with depth: the fast aniso- 
tropic direction does not align with APM within any depth interval, 
and the anisotropy is weakest near the base of the plate, where APM- 
induced strain should accumulate, and peaks near 250 km depth, deep 
in the asthenosphere, consistent with deformation caused by dynamic 
flow. Although our results strongly favour asthenospheric flow as the 
source of the observed anisotropic signal beneath the plate, we cannot 
directly constrain the details of this flow. This is because the apparent 
fast directions correspond to differential shear between plate flow, 
flow within the asthenosphere, and flow within the subasthenospheric 
mantle*4—the former is known, but the latter two are not. This ambi- 
guity exists for the case of shear flow with length scales larger than the 
400-km array, and for deformation processes with length scales smaller 
than the NoMelt array, where the azimuthal signal will additionally 
include averages of local fabric variations. 

The lack of an APM signal in our high-resolution study contrasts 
sharply with most large-scale studies of the Pacific mantle?"!!”°. The 
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Figure 3 | Shear velocity and azimuthal anisotropy models in the 
upper mantle beneath NoMelt. a, The grey band depicts the span of 
shear velocity models resulting from inversion using 100 distinct starting 
models; the mean shear velocity model and associated standard deviation 
(s.d.) are shown by the solid red line and dashed black line, respectively. 
The blue line is a 52-110-million-year-old Pacific model’? as a reference. 
The inset shows the phase velocities (C, calculated on the basis of the 
inverted shear velocity models; grey lines) compared to the observations 
(blue circles with 1 s.d.). b, The modelled strength of azimuthal anisotropy 


difference between our inferred fast directions and that predicted by 
APM is robust and statistically significant (see Methods). The discrep- 
ancy between our result and previous models probably arises due to 
the averaging inherent in seismic tomography; the amplitude of the 
largest-scale structures is recovered, whereas the strength of smaller- 
scale structures is underestimated if they are poorly resolved. APM is 
coherent at the scale of the entire Pacific plate, and this may explain 
why it is so prominent in Pacific-scale tomographic models. In detail, 
a few plate-scale models show subtle non-APM variations in the 
NoMelt region’, and some authors prefer alternative flow processes to 
APM evenat the plate scale’*. Our result demonstrates that the smaller- 
scale processes of corner flow and asthenospheric dynamics are 
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Figure 4 | Schematic of proposed models for azimuthal anisotropy 
beneath the NoMelt experiment. a, b, Pressure-gradient-driven flow (a) 
and density-driven small-scale convection (b) in the asthenosphere. At 
the top is the surface view; black arrows show the direction of north, with 
circles denoting NoMelt broadband OBSs. The red arrows indicate the 
flow direction in the lithosphere-asthenosphere system. The lithosphere 
moves at plate velocity at all depths. The direction of dynamic flow in the 
asthenosphere is shown as roughly east-to-west, as obtained from our 
azimuthal anisotropy model. The anisotropy signals in the lithosphere 
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is shown by the solid red line with the grey shading showing +1s.d. from 
1,000 randomly generated resamplings of the observed measurements. 

The inset shows the predicted peak-to-peak amplitude (A; grey lines) 
compared to the observations (blue circles with error bars of 1s.d.). 

c, As for b but showing the direction of the azimuthal anisotropy. The 
vertical black and grey dashed lines indicate the orientation of fossil spread 
direction (FSD) and apparent plate motion (APM)*?. The inset shows the 
predicted fast direction (@; grey lines) compared to the observations 

(blue circles with error bars of 1s.d.). 


regionally more dominant than APM in controlling the development 
and evolution of seismic anisotropy in the mantle beneath the oceans. 

The pattern (or structure) of anisotropy with depth suggests two 
scenarios for the dominant geodynamic flow in the central Pacific 
asthenosphere. Both scenarios hypothesize a low-viscosity astheno- 
spheric channel*”®, with the viscosity within the channel described by 
a power-law rheology. Large-scale pressure gradients within the asthe- 
nosphere would induce Poiseuille flow with a plug-like velocity profile 
(Fig. 4a). In such flow, fabric-inducing strain gradients are localized 
near the top and bottom boundaries of the channel, with relatively little 
fabric produced in the centre of the system; this scenario nicely matches 
the anisotropy-strength profile observed beneath NoMelt. Anisotropy 


NoMelt 


Lithosphere 


Asthenosphere 


are frozen rock fabric (shown as brown segments) imparted by shear 
strain during corner flow at the ridge axis. Anisotropic strength in the 
lithosphere is largest at the top, decreases with depth, and is orientated 
with the FSD. The shear-strain gradient (maroon curve; Z, depth; 7, shear 
strain) resulting from dynamic flow is largest at the top and the base of the 
asthenosphere and smallest in the mid-asthenosphere for flow driven by 
either pressure gradients or density, producing rock fabric as indicated by 
the brown segments. 
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associated with the top boundary is difficult to distinguish from the 
lithospheric fabric, but the peak near 250 km depth would represent a 
shear zone at the lower boundary of the channel. Thermally induced 
pressure gradients within the asthenosphere may be associated with 
several geodynamic phenomena, including return flow from subduc- 
tion zone to mid-ocean ridge”, ridge-induced flow beneath the oceanic 
plates?”, and, more generally, density-driven flow2”?3, Assuming a static 
upper mantle beneath the asthenosphere, the apparent east-west fast 
direction of anisotropy in the lower asthenosphere is generally con- 
sistent with the first two scenarios. That is, ridge-driven flow would 
be approximately east-west, while return flow would be west-east. 
Published density-driven flow models do not predict east-west asthe- 
nospheric flow in the NoMelt region”’ so the observations presented 
here should prove useful in refining future models. 

Alternatively, density-driven small-scale convection within the 
low-viscosity channel could also produce fabric consistent with the 
anisotropy observations (Fig. 4b). In this scenario, horizontal flow at 
the top and bottom of convection cells produces strong azimuthal ani- 
sotropy for surface waves, while the centre of the channel exhibits weak 
and/or quasi-vertical fabric that results in little detectable azimuthal 
anisotropy, especially if the convection is on a length scale smaller than 
the array. Small-scale convection has long been invoked to explain 
bathymetry”’ and gravity observations” in the Pacific, most specifi- 
cally in the form of organized thermal instabilities (“Richter Rolls”?’) 
beneath the base of the plate. Direct observations of such structures 
in the asthenosphere have proved elusive*!, and our observation of 
east-west fabric is not consistent with the traditional Richter-Roll 
mechanism, which predicts horizontal fabric orthogonal to the APM- 
oriented long axis of the rolls. However, a recent high-resolution gravity 
compilation” reveals gravity lineations with orientations that vary over 
length scales of several hundred kilometres, perhaps indicating that 
small-scale convection beneath the Pacific is more three-dimensional 
than the traditionally invoked mechanism. Our observations may place 
direct constraints on the localized orientation of small-scale convection 
within the NoMelt region. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
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METHODS 


Tilt and compliance noise correction for OBS vertical-component data. OBS data 
often contain substantial noise sources that are not present in land-based seismic 
records. OBS vertical data are contaminated by compliance noise induced by 
infragravity waves that are also recorded on bottom-pressure data™, and tilt noise 
induced by bottom currents that leaks from horizontal components*. We calculate 
spectral transfer functions between pressure-vertical and horizontal-vertical com- 
ponent data, and apply this transfer function to the vertical components to remove 
the compliance and tilt noise. At this location and water depth (about 5,200 m), 
the coherent tilt noise is usually modest, and results in a subtle reduction in noise 
at periods longer than about 60s (Extended Data Fig. 1). The compliance signal is 
quite strong, but is limited to periods longer than about 90s, and after correction 
the noise levels of vertical data decrease substantially at long periods. Owing to 
possible spatial and temporal variation in noise characteristics, the transfer func- 
tions of each station are determined for each event individually**. To construct 
the transfer functions, we calculated the spectra and coherence of all pairs of 
vertical- and horizontal-component seismic data, and differential-pressure data, for 
two days before each event recording. The two-day-long seismogram was cut into 
40 overlapping time windows of length 6,000, and spectra, coherence, and transfer 
functions were calculated and applied as described by refs 1 and 2. 

Phase velocity analysis. At long periods (20-150), we estimate phase velocities 
within the array by applying the Automated Surface-Wave Measuring System*” 
on Rayleigh waveforms (Extended Data Fig. 2a) of 19 teleseismic earthquakes 
with moment magnitude over 6.5 (Extended Data Fig. 3). Frequency-dependent 
phase delays between pairs of nearby stations are measured using cross correlation; 
the phase delays are then inverted for phase velocities via the Eikonal equation*®. 
Lateral variations in phase velocities are small, and we utilize the dispersion in 
the centre of the array for each event to characterize average structure within the 
array. The measured phase velocities are strongly dependent on station-to-source 
azimuth, and at each frequency we solve for the azimuthally averaged phase velocity, 
plus the magnitude and direction of azimuthal variations of the form cos(26) 
+ cos(40), where @ represents azimuth from the centre of the array’. 

We apply the ambient-noise cross-correlation technique to obtain the phase 
velocities at short periods (10-20 s). The cross-correlation spectra are stacked to 
retrieve surface-wave Green’s functions between station pairs (Extended Data 
Fig. 2b), and the frequency-dependent phase velocity curve for each station-to- 
station path is estimated by fitting the cross-spectra‘’. These inter-station disper- 
sion curves are inverted for azimuthally averaged phase velocity plus cos(20) + 
cos(4@) variations at each frequency. While we include 4@ terms for the azimuthal 
variations in both ambient noise and earthquake observations, the amplitude of 
these terms is very small for all periods (0.09% to 0.4%), and the changes to the 
26 terms is indistinguishable. F-test calculations suggest that these estimates of 
azimuthal anisotropy do not substantially improve the fit to the data when com- 
pared to models including only 26 variation, so we utilize the 20-only estimates to 
invert for azimuthally anisotropic velocity structure. We use a bootstrap resampling 
algorithm to test the robustness and stability of the 20 estimates at each frequency; 
the error bars on the observations (Fig. 3 and Extended Data Fig. 4) conservatively 
include the formal errors of the misfit, as well as the potential variance inferred 
from the bootstrap analysis. 

Inversion for azimuthally anisotropic shear velocity. The azimuthally averaged 
phase velocity dispersion curve is used to estimate a shear-velocity profile as a 
function of depth via linearized least-squares inversion*!. We calculate a set of 
acceptable one-dimensional V,, models using a set of 100 starting models. The 20 
azimuthal dependence can be expressed as combinations of the elastic parameters 
of B, H and G (ref. 42), with the Rayleigh waves dominated by the parameters for 
G (ref. 43). In this study, we then modelled the cos(20) azimuthal variations in 
phase velocity by using combinations of the depth-dependent parameters G, and 
G,, where the subscripts c and s denote cosine and sine azimuthal dependence“. 
The amplitude and azimuth of G as a function of depth can be computed as 


{G2+G2 and 0.5tan“13:, respectively. 


The observed peak-to-peak amplitude A and fast direction ¢ as a function 
of period can be used to define the data parameters D, and D,, where the sub- 
scripts c and s denote cosine and sine fast-direction dependence: D-= Acos2¢ 
and D,= Asin2¢. 

The kernels that relate the data parameters D, and D, to model parameters and 
can be assumed to be the same down to a depth of 500km as L = pV2 (ref. 44). We 
use the average of the best-fitting V,, model determined from the average disper- 
sion curve for the kernel calculation in the azimuthal inversion. We invert for G, 
and G, with a smooth parameterization and a linearized least-squares 
algorithm that minimizes misfit of D. and D, (ref. 42). 

Forward calculations. We utilize forward modelling to explicitly test the robust- 
ness of three key characteristics of the inverted anisotropy models (Extended 
Data Fig. 4). First, we test for the necessity of the secondary peak in anisotropy 
deep in the asthenosphere. This feature is required to explain the persistent strong 
azimuthal signal observed in phase velocity at periods of 90s and longer. Squeezing 
tests suggest that this structure probably persists to at least 300 km; restricting it to 
smaller depth maintains a peak in amplitude near 200-250 km, with the amplitude 
of the peak increasing dramatically as the depth distribution is pushed to depths of 
<300km. Second, we test for the possibility of the smoother models for the 
strength and the direction of the anisotropy (orange solid lines) that specifically do 
not have a minimum of amplitude in the asthenosphere. Owing to the weakening 
in the observed strength of the peak-to-peak velocity variations between periods of 
30-60, the smoother model without a notch in amplitude near a depth of 120km 
does not fit the decaying pattern in G amplitude. Third, we evaluate whether a 
layer of asthenospheric anisotropy parallel to apparent plate motion (APM) might 
be unresolved by our surface-wave data. Such anisotropy is often inferred at the 
base of the plate”. We find that, if present, APM anisotropy must be weak (<1%) 
and within a layer of thickness <20km to satisfy the observed fast-direction obser- 
vations at periods of 40s and greater within the observational uncertainties. 
Code availability. NoMelt data has been archived under network code ZA at 
the IRIS Data Management Center (http://ds.iris.edu/gmap/ZA?timewindow= 
2011-2013). Software for array analysis of surface waves is available at https:// 
ds.iris.edu/ds/products/aswms/. 
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Extended Data Figure 1 | Power spectral densities (PSDs) of raw, tilt 
and compliance noise-removed seismograms. The PSD values are 
calculated from vertical data recorded by one NoMelt station, B13, located 
at 9.25°N, 145.55° W). The black curves show the spectrum of the raw 
vertical component. The red and blue curves show the spectrum after 
removing the tilt noise and compliance noise, respectively. The PSD is 
reported in decibels (dB) referenced to 10log/AAS (m~?s-4Hz~!)], where 
AAS is the acceleration-amplitude spectrum. The new high- and low-noise 
models of ref. 45, NHNM and NLNM (thin dashed lines), are shown as 
references. 
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Extended Data Figure 2 | Rayleigh waveforms recorded by NoMelt. a, Broadband vertical-component seismograms filtered in the 20-100-s period 
band from an event (green circle) in the southwestern Pacific (see inset map). The red inverted triangle denotes the NoMelt location. b, Ambient noise 
cross-correlation waveforms, filtered between 10s and 25s in the time domain. 
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Extended Data Figure 3 | Shallow-teleseismic event distribution. 19 
events (red stars) with high-quality Rayleigh waves were recorded during 
NoMelt one-year deployment. White lines represent the corresponding 
great-circle paths. Grey contours show the epicentral distances at 

10° intervals. 
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A somitic contribution to the apical ectodermal 
ridge is essential for fin formation 


Wouter Masselink!+, Nicholas J. Cole2, Fruzsina Fenyes!, Silke Berger!, Carmen Sonntag!, Alasdair Wood!, Phong D. Nguyen!, 
Naomi Cohen!, Franziska Knopf*+, Gilbert Weidinger*, Thomas E. Hall! & Peter D. Currie! 


The transition from fins to limbs was an important terrestrial 
adaptation, but how this crucial evolutionary shift arose 
developmentally is unknown. Current models focus on the distinct 
roles of the apical ectodermal ridge (AER) and the signalling 
molecules that it secretes during limb and fin outgrowth. In 
contrast to the limb AER, the AER of the fin rapidly transitions 
into the apical fold and in the process shuts off AER-derived signals 
that stimulate proliferation of the precursors of the appendicular 
skeleton’"!°. The differing fates of the AER during fish and tetrapod 
development have led to the speculation that fin-fold formation was 
one of the evolutionary hurdles to the AER-dependent expansion of 
the fin mesenchyme required to generate the increased appendicular 
structure evident within limbs'!. Consequently, a heterochronic 
shift in the AER-to-apical-fold transition has been postulated to be 
crucial for limb evolution". The ability to test this model has been 
hampered by a lack of understanding of the mechanisms controlling 
apical fold induction'!!*. Here we show that invasion by cells of a 
newly identified somite-derived lineage into the AER in zebrafish 
regulates apical fold induction. Ablation of these cells inhibits apical 
fold formation, prolongs AER activity and increases the amount 
of fin bud mesenchyme, suggesting that these cells could provide 
the timing mechanism proposed in Thorogood’s clock model of the 
fin-to-limb transition". We further demonstrate that apical-fold- 
inducing cells are progressively lost during gnathostome evolution; 
the absence of such cells within the tetrapod limb suggests that their 
loss may have been a necessary prelude to the attainment of limb- 
like structures in Devonian sarcopterygian fish. 

To understand the process of fin-fold formation in detail, we system- 
atically dissected how different cell types within the zebrafish fin arise. 
Muscle and vascular precursors have previously been demonstrated to 
migrate from the somites to the developing limb bud!*"'®. To define the 
role that somitic mesoderm contributions have during fin development, 
we undertook time-lapse analyses using transgenic lines that drive 
expression of green fluorescent protein (GFP) from pax3a regulatory 
sequences, as Pax3 expression marks migratory limb myoblasts in 
amniotes!’. We determined that somitic mesoderm cells migrate 
from somites 4—7 to generate myoblasts within the pectoral fin and 
hypaxial musculature (Fig. la—c, Supplementary Video 1). This analysis 
further identified a subset of the pax3a-GFP-positive somite cells 
within the fin mesenchyme that subsequently migrate and invade the 
AER (6.40.4 (mean + s.e.m.) cells per fin at 37h post-fertilization 
(hpf), n= 18). These cells cross the basement membrane that separates 
the fin mesenchyme from the AER (Fig. 1d-k, Supplementary Video 2), 
adopting a stereotypical elongated morphology distinct from the under- 
lying cuboidal mesenchymal cells (Fig. lh-k). Analyses of embryos 
stained for laminin, which marks the AER basement membrane 
(Extended Data Fig. la—h), as well as histological and transmission 


electron microscopy studies of the developing fin in the pax3a-GFP 
line (Fig. 1l-o, Extended Data Fig. 1m-p), confirm the position of 
these cells within the AER. These cells adopt the expression of epithelial 


TgBAC(pax3a:GFP) @ 23-33 hpf 


Pectoral fin Pectoral fin time lapse Time lapse 


Pectoral fin 


Figure 1 | A dual embryonic origin for zebrafish pectoral fin AER. 

a-c, Time-lapse analysis of TgBAC(pax3a:GFP) transgenic embryos 

from 23 to 33 hpf. Somite 4-6 contributes to the zebrafish pectoral fin 
mesenchyme (PFM), while somite 7 contributes to the posterior hypaxial 
muscle (PHM) (n= 6) (Supplementary Video 1). d-g, Time-lapse analysis 
of Tg(pax3a:GFP) transgenic embryos from 32-37 hpf. Cells move distally 
from within the muscle progenitor pool, invade the AER and take on an 
elongated morphology (1 = 4). Inserts in a and d show orientation of images 
that follow. h-k, Tg(pax3a:GFP); Tg(actc1b:mCherry) embryos at 36 hpf 
identify a subset of GFP-positive mCherry-negative cells with an elongated 
morphology localized distal to the AER basement membrane (arrowheads, 
hand k) (n= 18). j, k, Differential contrast images in which the AER 
basement membrane can be discerned (white arrowheads). 1, Toludine blue 
(TB)-stained section of a 44 hpf Tg(pax3a:GFP) embryo stained with an 
anti-GFP antibody (brown, diaminobenzidine (DAB), black arrowhead) 
shows the location of this cell distal to the basement membrane (white 
arrowheads). m, Adjacent section scanned with transmission electron 
microscopy (TEM) reveals that the electron dense DAB-stained cell (black 
arrowhead) lies distal to the basement membrane (white arrowheads). 

n, Merge of the two adjacent sections. 0, High-magnification TEM as 
outlined in m. Dashed lines approximate the location of the plasma 
membrane. Scale bars, 100j1m (a—c), 20j1m (d-n) and 3\1m (0). 


lAustralian Regenerative Medicine Institute, Level 1, 15 Innovation Walk, Monash University, Wellington Road Clayton, Victoria 3800, Australia. 7MND and Neurodegenerative Diseases Research 
Program, Faculty of Medicine and Heath Science, Macquarie University, New South Wales 2109, Australia. The Kennedy Institute of Rheumatology, University of Oxford, Roosevelt Drive, Oxford 
OX3 7FY, UK. “Institute of Biochemistry and Molecular Biology, Ulm University, Albert-Einstein-Allee 11, 89081 Ulm, Germany. 5EMBL Australia Melbourne Node, Level 1, Building 75, Monash 


University, Wellington Road, Clayton, Victoria 3800, Australia. 


+Present address: DFG- Center for Regenerative Therapies Dresden, Technische Universitat Dresden, FetscherstraBe 105, 01307 Dresden, Germany. 


542 | NATURE | VOL 535 | 28 JULY 2016 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Tg(tbx6:gal4-vp16); Tg(UAS:Kaede) 


a Photoconvert Incubate 


Analysis 


Photoconvert 
somite 4 


Photoconversion @ I5 SS 


Tg(ubi:zebrabow) grafted into Tg(actc1b:GFP) 
f Donor Host 


Analysis 
Homotopic 
transplantation cg *© 
somite 4 a 
Graft Incubate 


Homotopic somite transplantation 


Tg(msgn1:CreERT2); Tg(ubi:zebrabow) 
n Before 4-OHT After 4-OHT 


Somite specific Analysis 
multi-colour iS 
Cre-mediated Ce= 
labelling 
Recombine — Incubate 


Merged 


SS map 


> a 


4-OHT @ shield stage 


Figure 2 | A specific lineage of somite 4 contributes to the AER 

in zebrafish. a—e, Photoconversion of somite 4 in embryos transgenic 
for Tg(tbx6:gal4-vp16) and Tg(UAS:Kaede) at the 15-somite stage (SS) 
(b), imaged at 2 dpf (c-e), reveals photoconverted Kaede (red) within the 
AER (n= 11), (schematised in a). c—e, A sub-set of the muscle progenitor 
pool (asterisks), presumptive vasculature (arrows) and cells in the AER 
are labelled (arrowheads). f-m, A Tg(ubi:zebrabow) somite (red, g, i) 
homotopically transplanted into a host embryo transgenic 

for Tg(actc1b:GFP), which marks muscle in green (h, i), identifies 

somite 4 as the source of somitic mesoderm-derived cells in the AER 

at 48 hpf (j-m) (schematised in f). n-r, Somite-specific “zebrabow 
lineage analysis (schematised in n). 4-OHT, 4-hydroxytamoxifen. 

o-r, Tg(msgn1:CreERT2); Tg(ubi:zebrabow) double-transgenic embryos 
demonstrate recombined zebrabow expression patterns in the pectoral 
fin AER at 36 hpf (arrowhead, n= 4). Dashed white lines indicate 
basement membrane. CFP, cyan fluorescent protein; RFP, red fluorescent 
protein; YFP, yellow fluorescent protein. Scale bars, 50 jm (b-e, j-m), 
100 um (g-i) and 201m (0-1). 


markers within the AER (Extended Data Fig. li-l), but do not at any 
stage express markers of other fin cell types known to be somitically 
derived such as muscle (n = 14, Fig. lh-k, Extended Data Fig. 2d-g) 
or vasculature (nm = 12, Extended Data Fig. 2a-c). 

These observations unexpectedly revealed that the AER, which has 
long been considered a tissue of pure ectodermal origin, has a dual 
mesodermal and ectodermal composition in zebrafish. To identify 
the specific somite that contributes cells to the AER, we undertook a 
series of distinct fate- mapping strategies. First, regional somite-specific 
photoconversions were performed using embryos transgenic for the 
somite-specific driver line Tg(tbx6:gal4-vp16) and Tg(UAS:Kaede), 
which confirmed somitic contributions to pectoral fin and hypaxial 
muscle (Extended Data Fig. 3a—h), the vasculature and the AER (n=7, 
Extended Data Fig. 3i-k). Sequential photoconversions of individual 
somites determined that somite 4 specifically provides a contribution 
to the AER (Fig. 2a—e, Extended Data Fig. 3l1-q). Homotopic somite 
transplantations subsequently confirmed that somite 4 has the 
unique capacity to contribute to the AER (Fig. 2f-m, Extended Data 
Fig. 4a—m, Supplementary Video 3). However, mechanical removal 
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of somite 4 did not lead to a loss of somite-derived cells in the AER 
(Extended Data Fig. 4n-r), suggesting flexibility in cell specification 
mechanisms not solely dependent on the somite of origin. Lastly, we 
undertook a ‘zebrabow’ lineage analysis using the somite-specific 
CreERT2 line, Tg(msgn1:creERT2)'® (Fig. 2n). Using this approach, we 
determined that a separate lineage of somite cells, distinct from those 
that generate fin muscle, contributes to the AER (n= 9, Fig. 20-1). 
Thus, three independent fate-mapping strategies—photoconversion, 
somite transplantation, and Cre/loxP lineage analyses—reveal the 
existence of a distinct somitic lineage that contributes to the zebrafish 
AER. 

Despite extensive fate-mapping analyses, a somitic contribution 
to the AER of tetrapods has not been described!*-1°°, neither has 
a somitic contribution to the AER of the caudal fin been observed’, 
together suggesting that somite-derived cells probably influence a 
process specific to the development of paired fins. The formation of 
the fin fold and the subsequent generation of the larval fin rays, or 
actinotrichia, within the fold are crucial for paired fin formation?!?3, 
Two pieces of evidence suggest that somite cells could be involved in 
both of these processes. First, somite cells contribute to the fin AER 
immediately before the AER-to-apical-fold (AF) transition (Extended 
Data Fig. la—h). Second, antibodies to distinct collagen isoforms that 
visualize actinotrichia development demonstrate that actinotrichia 
form directly proximal to the somite-derived cells within the AF 
(Fig. 3a—p, right insets). Furthermore, an antibody raised against colla- 
gen type I ala (Collala), which is known to be involved in actinotri- 
chia formation”, specifically localizes to the pax3a-GFP-positive cells 
in the AF (Fig. 3q-u, bottom insets), indicating that these cells are 
probably directly secreting collagen required for actinotrichia forma- 
tion. Collectively, these analyses suggest that the somite-derived cells 
present within the AF are actinotrichia-forming cells, a cell type previ- 
ously proposed to be required for pectoral fin-fold development”! but 
which remained unidentified until now. 

Next, ablation of somite-derived cells was performed at two different 
stages of AF formation using two distinct strategies. First, a genetically 
targeted cell ablation system was used to remove somite-derived 
cells in the AER immediately before the AF induction. Crossing the 
Tg(tbx6:gal4-vp16) line to the Tg(UAS-E1b:Eco.NfsB-mCherry)?™ 
strain results in the expression of the nitroreductase enzyme (NfsB) 
specifically within somite-derived cells, with the highest levels evident 
in the cells within the AER and exclusively maintained in these cells 
by 48 hpf (Fig. 4i, Extended Data Figs 3i-k, 5b). Specific ablation of 
somitic mesoderm-derived cells within the AER occurred after metro- 
nidazole (MTZ) addition (92% ablation, n = 25, Extended Data Fig. 5), 
which led to a severe disruption in AF formation (n = 11) anda lack 
of actinotrichia deposition (n =5, Fig. 4c, g), defects not evident in 
control conditions (Fig. 4a, b, e, f). Fin fold length in treated larvae 
was reduced to 38% of untreated controls at 2.5 days post-fertilization 
(dpf) (MTZ alone 66.0 + 1.5 1m, n =5; NfsB alone 69.8 + 1.0,um, n =5; 
MTZ*/NfsB* 31.8 +0.4|1m, n=5, Fig. 4d), increasing only to 59% 
of the length of controls at 4 dpf (MTZ alone 225.2 +3.7 1m, n=5; 
NfsB alone 237.0 + 2.2 um, n=5; MTZt/NfsB* 147.9 +0.8,m, n=5, 
Fig. 4h). Collectively, these analyses indicate that somite-derived AER 
cells are required for fin-fold induction and they have therefore been 
termed apical-fold-inducing cells (AFICs). 

The second approach involved the laser ablation of AFICs after 
migration into the AER. A single cell ablation method was established 
that used the property of the photoactivatable protein Kaede to absorb 
laser light efficiently at 405 nm” (Extended Data Fig. 6, Supplementary 
Video 4). Using this method, Kaede-positive cells within the AF of 
Tg(tbx6:gal4-vp16); Tg(UAS:Kaede) double-transgenic animals were 
ablated at 40 hpf, a time point immediately after both fin-fold induction 
and the initiation of actinotrichia formation (Fig. 3e-h, Extended 
Data Fig. la—h; cells remaining after ablation 0.4+ 0.24, n=5, 
Fig. 4j, Extended Data Fig. 7). By contrast, contralateral control fins 
possessed a normal number of Kaede-positive cells within the distal 
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Figure 3 | Somitic mesoderm-derived cells in the AER express 
actinotrichia-specific collagens. a—p, Transverse views of Tg(pax3a:GFP) 
pectoral fins stained for GFP (green), collagen 2 (Col2; red), and DAPI 
(blue). Pectoral fins at 36 hpf (a-d) (n = 7), 40 hpf (e-h) (n= 8), 44 hpf 

(i-l) (n=8), and 48 hpf (m-p) (n =6). Actinotrichia growth radiates 
directly from pax3a:GFP-positive cells within the AF at 44 hpf (j, 1) and 

48 hpf (n, p). High-magnification views of the boxed region of the extending 
fin fold are shown in the right panels of d, h, 1 and p. Arrowheads denote 


fin fold (9.2 + 0.37, n=5, Fig. 4i). Ablation of AFICs from the AF 
resulted in a notable reduction in the size of actinotrichia, which fail 
to undergo distal extension after ablation (Fig. 4j), a defect not evident 
within contralateral, non-ablated fins (Fig. 4i). Furthermore, the fin 
fold itself was reduced to nearly 30% of its overall length after ablation 
(contralateral 435.3 + 5.2 um, n=5; laser-ablated 315.1 +8.9 1m, 
n=5, Fig. 4k). By contrast, ablation of cells from the AER ectoderm, 
before the invasion of AFICs, had no effect on fin development (n=5, 
Extended Data Fig. 7w-zc), indicating that non-specific cell loss 
from the ectoderm was not responsible for the observed phenotype. 
Furthermore, in both of our ablation regimes the mesenchyme of the 
endoskeletal disc underwent a significant expansion (MTZ ablation; 
MTZ alone 208.2 + 1.9,4m, n=5; NfsB alone 214.141.7m, n=5; 
MTZ*/NfsB* 246.0 + 4.2 um, n=5; laser ablation; non-ablated 
contralateral fin 389.8 + 4.5 tm, n =5; laser-ablated fin 426.5 + 3.7 um, 
n=5, Fig. 4h, k). Collectively, these analyses reveal that ablation of 
AFICs disrupts both fin-fold induction and actinotrichia formation 
and leads to an expansion of the underlying fin mesenchyme. 

To test whether this increased proliferation is due to continued 
signalling from the AER to the underlying mesenchyme in ablated 
larvae, we assessed the gene expression patterns of fgf24 and fgf8, two 
major proliferative signals in the zebrafish fin!>°. fef24 undergoes a 
shift in expression from the underlying fin mesenchyme to the AF at 
the time of fin-fold formation2", a shift that fails to occur after MTZ 
treatment, suggesting a delay in AER maturation occurs within ablated 
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Col2 


DAPI + Collata 


pax3a:GFP-positive cells within the AF. q-u, pax3a:GFP-positive cells 
express Collala. Staining of Tg(pax3a:GFP) pectoral fins at 3 dpf for 

GFP (q), Collala (r), Col2 (s), DAPI and Collala (t) and merged images 
(u) (n=7). High-magnification views of q-u are shown in the bottom 
panels. Scale bars, 20|1m (a—p), 101m (d, h, |, p, high-magnification, right 
panels), 50j1m (q-u) and 251m (q-u, high-magnification, bottom panels). 
Dashed lines indicate the distal fin edge. 


larvae (Fig. 41, m, n= 12, 75% affected). fgf8 expression is specifically 
restricted to the AER until it is reduced during AF expansion (Fig. 4n). 
In ablated larvae, however, both the levels of expression and the region 
of the AER that expresses fgf8 expand markedly when compared to 
non-ablated controls (Fig. 4n, 0, n= 13, 77% affected). Furthermore, 
the mitotic index within the mesenchyme of ablated fins was more than 
double that exhibited by untreated controls (MTZ alone 0.143 + 0.004, 
n= 12; NfsB alone 0.148 + 0.004, n= 12; MTZ*/NfsB* 0.342 + 0.006, 
n= 12; Extended Data Fig. 8a—d), an increase inhibited by the addition 
of the Fgf signalling inhibitor SU5402 (Extended Data Fig. 8e). These 
results indicate that the increase in proliferation evident in ablated fins 
is Fgf dependent, a result concordant with embryological studies that 
prolonged fin AER signalling and inhibited AF induction through the 
serial removal and regeneration of the zebrafish AER”. 

To understand better the evolutionary importance of our 
findings, we examined the existence of AFICs during pectoral fin 
development within the embryos of species strategically positioned 
in the gnathostome phylogeny (Fig. 4p-w, Extended Data Fig. 9). 
Collala immunoreactivity reveals that cells analogous to the AFICs 
of zebrafish are present within the AER/AF of all species examined, 
although they vary in number and the timing of residency. Embryos of 
the basal gnathostome, the epaulette shark (Hemiscyllium ocellatum), 
exhibit a large number of Collala-expressing cells within the AER of 
their pectoral fins, which persist at all developmental stages exam- 
ined (Fig. 4p-r, Extended Data Fig. 9a-h). Embryos of the basal 
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Figure 4 | Somitic mesoderm-derived cells are required for fin-fold 
induction and actinotrichia formation, and are present within fin AER 
throughout the gnathostome phylogeny. a—h, Embryos transgenic for 
Tg(tbx6:gal4-vp16) with/without the Tg(UAS-E1b:Eco.NfsB-mCherry) 
transgene were incubated with/without MTZ. Staining for Col2 (red), 
pan-laminin (green), and DAPI (blue) were performed at 2.5 dpf (a-c) 
(n=5, each group) and 4 dpf (e-g) (1 =5, each group). Digital sections 
reveal MTZ-induced AFIC ablation results in a block of fin-fold formation 
(c, g, middle panels), schematically represented in an outline of fin-fold 
shape (c, g, right panels). NfsB~/MTZ* (a, e) and NfsBt/MTZ~ (b, f) 
controls do not reveal disruption in fin-fold formation. Fin fold length is 
reduced and mesenchyme expanded at 2.5 dpf (d) and 4 dpf (h). i, j, Laser- 
mediated ablation of Kaede-positive cells in the AF of Tg(thx6:gal4-vp 16); 
Tg(UAS:Kaede) double-transgenic embryos at 40-44 hpf analysed at 

5 dpf. Kaede-positive cells are absent in the fin fold of laser-ablated fins 

(j) (n= 5), whereas Kaede expression is persistent in contralateral non- 
ablated control fins (i) (n =5) (red). Actinotricha formation is disrupted 
and morphology is altered (dashed white lines) in laser-ablated fin folds 
(j) compared to the contralateral control (i). k, Quantification of proximal 
distal axis measurements in i and j, revealing a reduction in the size of the 
ablated fin fold, while the mesenchyme is increased in size. l-o, In-situ 
hybridization for fgf8 and fgf24 on Tg(thx6:gal4-vp16); Tg(UAS-E1b:Eco. 
NfsB-mCherry) ablated and non-ablated embryos. Dashed lines demarcate 


actinopterygian, the North American paddlefish (Polyodon spathula), 
possess Collala-expressing cells within the AER/AF at similar 
numbers and timing to that evident in zebrafish, but reduced in 
number in comparison to shark (Fig. 4s—u, Extended Data Fig. 9i-p). 
Embryos of the Australian lungfish (Neoceratodus forsteri), a 


Laser ablated 


mm Mesenchyme 
k Mm Fin fold 
* 


+e 


Length (um) 


the pectoral fin. At 44 hpf, fgf24 expression localizes to the AER of control 
animals (1) while in MTZ-treated animals expression remains in the 
underlying mesenchyme (m). At 60 hpf fgf8 expression is maintained 
throughout the AER of MTZ-treated animals (0) compared to control 
animals (n), where it becomes restricted to the distal tip of the fin fold. 
p-r, A fin of an embryo of the epaulette shark H. ocellatum at 65-mm stage 
(approximately S. canicula stage 30/31) stained for Collala (red), skeletal 
muscle (green) and DAPI (nuclear stain; blue). Collala-expressing cells 
are located throughout the AER at high density. q, High-magnification 
views of boxed region in p. r, Digital cross section of the dorsal half of 

the distal most fin edge at the level shown in p. s—u, A 3 dpf embryo of 

the basal actinopterygian P. spathula, transverse sectioned and stained for 
Collala expression (brown, DAB). t, u, High-magnification views of the 
boxed regions in s. Arrows in inset indicate Collala-expressing cells. 

v, w, N. forsteri (Australian lung fish) stage 47 embryo in transverse 
section and stained for Collala and counterstained with haematoxylin and 
eosin. A single Collala-expressing cell is evident in the most distal tip of 
the AER. Scale bars, 50|1m (a-c, l-o, r), 101m (a-c, high-magnification, 
middle panels), 100 1m (e-g, i, j), 25 um (e-g, high-magnification, middle 
panels; i, j, high-magnification, right panels), 400 1m (p) and 500 1m (s, v). 
*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001 (analysis of variance 
(ANOVA) with Tukey’s post-hoc analysis (d, h), and unpaired Student’s 
two-tailed t-test (k)). NS, not significant; error bars show s.e.m. 


sarcopterygian fish species that phylogenetically abuts the tetrapod 
transition, possess the least number of Collala-expressing cells within 
the AER of their pectoral fins, which are lost as the fin-fold develops 
(Fig. 4v-w, Extended Data Fig. 9q-x). The results of sampling these key 
phylogenetic species, combined with known data on limb development 
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in tetrapods , are broadly consistent with a model that suggests 
a reduction in AFICs over evolutionary time would allow for a greater 
expansion of the mesenchyme evident in tetrapod species. Our results, 
therefore, provide a mechanistic basis to explain how a heterochronic 
shift in AER activity could have been achieved developmentally 
(Extended Data Fig. 10), and in doing so provides experimental support 
for Thorogood’s theoretical clock model of fin to limb transition!!. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Fish husbandry and handling. Adult zebrafish were maintained on a 14-h 
light/10-h dark cycle using standard procedures”’. Embryos were maintained at 
28°C in E3 medium. A breeding colony of epaulette sharks (H. ocellatum) was 
established from wild caught specimens in a purpose circulating marine system. 
Females lay fertilised eggs at a rate of approximately two paired eggs every 2 weeks, 
and these were removed from breeding tanks and maintained at 26°C in separate 
tanks until the desired developmental stage was determined by candling. Formal 
staging was in length (mm) and with reference to previously published shark 
developmental staging series”®. Lungfish (N. forsteri) and paddlefish (P. spathula) 
embryos were obtained and staged as previously described”. Experimental 
procedures were approved by the Monash Animal Services Animal Ethics 
committee, Monash University. 

Statistical analysis. No statistical methods were used to predetermine sample size. 
In each experiment, embryos were randomly selected, and experimenters were 
blinded to the treatment group prior to analysis. Statistical analysis was performed 
with a minimum of 5 independent biological replicates; exact numbers have been 
indicated at relevant locations in the manuscript. Statistical analysis was performed 
with Prism (GraphPad) using Student’s unpaired two-tailed t-test when comparing 
two conditions. Error bars represent s.e.m. Analysis of variance (ANOVA) with 
Tukey’s post-hoc analysis was used when comparing multiple samples. 
Zebrafish transgenic lines. Transgenic lines used in this work were: 
Tg(actc1b:GFP)“" (ref. 30), Tg(kdr1:mCherry)“> (ref. 31), Tg(UAS:Eco.NfsB- 
mCherry)? (ref. 32), TgBAC(pax3a:GFP)!™ (ref. 33), Tg(pax3a:GFP)P°, 
Tg(thx6:gal4-vp16)*4'? (ref. 34), Tg(UAS:Kaede)' (ref. 35), Tg(msgn1: 
Cre-ERT2)?® (ref. 36), Tg(ubi:zebrabow)*!*! (ref. 37), Tg(act1b:mCherry)?“* 
(ref. 29), and Tg(h2a f/z:nlsKikGR1). Tg(h2a f/z:nlsKikGR1) was generated by 
cloning a 2-kb region of the zebrafish histone 2a promoter** next to the nuclear 
humanized version of the photoconvertible protein KikGRI1 (ref. 39) (MBL 
Corporation). Transgenesis was performed using the Tol2 system”. 

Histology and microscopy. Histological and sectioning methods for haematox- 
ylin and eosin staining, Toludine blue staining, TEM, and periodic acid-Schiff 
staining were based on previously published methods*!. Confocal live imaging 
was performed on a Zeiss LSM5/710 meta or on a Leica SP5 using a x20 1.0 N/A 
water immersion objective. Non-confocal live images were captured on a Zeiss 
LumarV.12. Antibody-stained samples were imaged on a Leica SP8 using 
a x63 1.4 N/A oil objective. Electron micrographs of ultrathin sections were taken 
ona Nova Nano SEM electron microscope (FEI). 

Ablation, photoconversion, transplantation and lineage analysis strategies. 
Single-cell ablations were performed on a Zeiss LSM5/710 meta using a standard 
405-nm diode laser. Using the ROI tool in Zen black 2012 (Zeiss) a region of 
interest comprising a single Kaede-positive cell was ablated**. Single-cell ablations 
in control fins were performed in a similar fashion, by irradiation of individual 
Kikume-positive nuclei*’. The genetically targeted cell-ablation system used the 
property of the nitroreductase enzyme (Eco.NfsB) to convert the innocuous 
prodrug MTZ into a cytotoxic agent. To undertake ablation we crossed the 
Tg(tbx6:gal4-vp16) transgenic line to the Tg(UAS-E1b:Eco.NfsB-mCherry)?* 
strain, which results in the expression of the nitroreductase enzyme specifically 
within somite-derived cells. Embryos were incubated with a range of MTZ 
concentrations at 30 hpf, before fin fold induction. This analysis determined that 
20 mM MTZ was the optimal concentration in terms of larval survival (92%, 
n= 25). Nfsb/MTZ ablations were consequently performed using 20 mM MTZ 
by bathing the embryos from 30-48 hpf. Photoconversion was performed by illu- 
minating selected somites of a 15-somite-stage embryo using the 405-nm diode 
laser of the Zeiss LSM5/710 meta. Somite transplantations were performed as 
described previously”’. Pectoral fin measurements were performed by flat mount- 
ing the mechanically dissected pectoral fin. The fin bud mesenchyme was defined 
using the DAPI signal, which marks the nuclei of these cells. Fin fold length was 
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determined by measuring the distance from the proximal most location of the 
actinotrichia to the distal most tip of the pectoral fin. Somite extirpation was per- 
formed in a similar manner to somite transplantations, without grafting a somite. 
Cre/loxP lineage analysis was performed as described previously***”. Fef inhibition 
used $U5402 (Sigma) treatment as described previously”. 

Antibody staining and in-situ hybridization. Antibody staining was performed as 
previously described?’ using antibodies anti-Col2 (DSHB, II-II6B3), anti-Collala 
(Sigma, SAB2701691), anti-Coll (Abcam, AB23730), anti-pan-laminin (Sigma, 
19393), anti-Kaede (MBL International, PM012), anti-We3 (DSHB, deposited by 
R. A. Tassava), anti-RFP (Abcam, ab65856), anti-GFP (Life Technologies, A10262), 
and secondary Alexa Fluor-coupled antibodies (Invitrogen). In situ hybridization 
was conducted as described”. Probes used were fef8 (Addgene, fb73a06) and fgf24 
(ref. 25). 

EdU labelling. Embryos were pulsed with 2.5mM EdU (Sigma), 10% DMSO 
for 30 min on ice, followed by 1 h at 28°C. EdU was visualized according to 
manufacturer's instructions using Click-iT Chemistry. Mitotic index was determined 
by dividing the number of EdU-positive cells by the total number of cells. 
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Extended Data Figure 1 | Somitic mesoderm-derived cells migrate into 
the AER before AF formation and occupy a position distal to the AER 
basement membrane. a-h, Somite-derived cells are positioned distally 
to the basement membrane of the AER. Pectoral fin of Tg(pax3a:GFP) 
transgenic embryos at 36 hpf (a-d, lateral view) and 48 hpf (e-h, 
transverse view), stained for pan-laminin (red), which marks the 
basement membrane of the AER (a, e), GFP (green) (b, f), DAPI (blue) 
(c, g), and a merged view (d, h). Arrowheads mark the somite-derived 
cells that invade the AER, and occupy a position distal to the basement 
membrane. i-1, Somite-derived cells are positioned within the AER and 
adopt expression of an epithelial marker. Dorsal view of a pectoral fin of 
a Tg(pax3a:GFP) embryo at 48 hpf, stained with an antibody against We3 
(red), which is an epithelial marker for the AER and wound epithelium 


in salamander limbs (i), GFP (j), a merged view (k), and a merged 
differential contrast (DIC) image (1). Somite-derived cells (arrows) not 
only reside in the epithelial AER, but also co-express the epithelial marker. 
m-p, Histological analysis of the positioning of somite-derived cells 
within the AER and AF. Haematoxylin and eosin (H&E) staining (m, o) 
and periodic acid-Schiff (PAS) staining (n, p) on sagittal sections 

of Tg(pax3a:GFP) pectoral fins at 44 hpf (m, n) and 48 hpf (o, p). 

PAS specifically stains the basement membranes of tissues owing to the 
presence of Schiff-reactive glycoproteins within these structures. Arrows 
mark somitic mesoderm-derived cells in the AER (m, n) and AF 

(0, p), respectively. Both these histological analyses reveal that 
somite-derived cells are positioned distally to a continuous basement 
membrane. Scale bars, 20 1m. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Tg(pax3a:GFP); Tg(kdr1:mCherry) 
44 hpf 48 hpf 52 hpf 


mCherry 


48 hpf 


Extended Data Figure 2 | Distally located GFP-positive cells in to the mCherry-positive cells of the developing vasculature (d-g). 

the AER of Tg(pax3a:GFP) transgenic larvae do not contribute to Tg(pax3a:GFP)-expressing cells in the AER do not colocalize with 
either vasculature or musculature. a—c, Tg(pax3a:GFP) does not Tg(actclb:mCherry) expression (d). Dorsal and ventral muscle masses 
label vasculature in the zebrafish pectoral fin. Medio-lateral view of are defined by mCherry expression (e). GFP and mCherry colocalization 
Tg(pax3a:GFP); Tg(kdr1:mCherry) pectoral fins at 44 hpf (a), shown in white (f). Merged image (g). Arrows mark GFP-positive cells in 
48 hpf (b), and 52 hpf (c). Elongated GFP-positive cells localize distally the AER, while arrowheads mark the vasculature. Scale bars, 501m. 
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Extended Data Figure 3 | Validation of somite specific photoconversion 
of Tg(tbx6:gal4-vp16); Tg(UAS:Kaede) double-transgenic embryos. 
a-q, Photoconversion of somite 4-7 (a-k) and somite 4 (I-q). 

a, Schematic representation of photoconversion. b-d, Dorsal views 

after photoconversion of somite 4-7 at the 15 somite stage reveals non- 
photoconverted Kaede (green) (b) and photoconverted Kaede (red) (c). 
Merged image shown in d. Somites 1-7 are numbered. e-g, Lateral 
views of the same photoconverted embryo at 2 dpf, revealing non- 
photoconverted Kaede (e) and a photoconverted Kaede (f). Merged 
image shown in g. h, Close-up view of photoconverted somites from e-g 
including brightfield. i-k, Close-up view of pectoral fin shown in e-g 


- =) 


incubate analysis 


f aede.green|j 2 dpf 
Va _— ; 7 
Z A. r* 


showing non-photoconverted Kaede (i) and photoconverted Kaede (j). 
Merged image shown in k. Asterisks indicate presumptive fin muscle 
progenitors, arrows mark photoconverted Kaede-positive cells of the 
PHM (e-g) and presumptive vasculature (i-k), arrowheads indicate 
photoconverted Kaede-positive cells in the AER. I-n, Dorsal view after 
photoconversion of somite 4 at 15 somites, showing non-photoconverted 
Kaede (1), photoconverted Kaede (m), and merged image (n). Somites 
1-6 are numbered. o-q, Lateral view of the same photoconverted embryo 
at 2 dpf, showing non-photoconverted Kaede (0), photoconverted Kaede 
specifically in somite 4 (p), and a merged image (q). Somites 2-6 are 
numbered. Scale bars, 100|1m (b-g, i-q), 501m (h-k). 
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Extended Data Figure 4 | Validation of homotopic somite of somites 3-6. n—p, Dorsal view of Tg(actc1b:GFP); Tg(pax3a:GFP) after 


transplantations. a—d, Dorsal view of a somite 4 transplantation somite 4 has been removed at the 15-somite stage. Solid lines indicate 
within a 13-somite stage embryo that has a Tg(ubi:zebrabow) (red) somite borders, numbers indicate somite identity. q-r, Lateral view of 
somite homotopically transplanted into a host embryo transgenic for Tg(actc1b:GFP); Tg(pax3a:GFP) at 36 hpf. GFP-positive cells are present in 


Tg(actc1b:GFP), which marks the myotome in green. e-h, Dorsal view of the AER of both the control (q) and the experimental side (r) of the same 
the embryo in a-d at 24 hpf. i-1, Lateral view of the embryo in a-d at 2 dpf. embryo (n=5). Scale bars, 100 1m (a-i), 50 1m (n-p) and 201m (q, r). 
m, Table of somite transplantations. Results from homotopic transplants 
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a O mM Mtz (n=25) 20 mM Mtz (n=25) |40 mM Mtz (n=25) 
Survival Rate 100% 92% 84% 
Abnormal trunk muscle morphology 0% 4% 84% 
0% 


Tg(tbx6:gal4-vp16); Tg(UAS-E1b:Eco.NfsB-mCherry) 
O mM MTZ 20 mM MTZ 40 mM MTZ 


48 hpf 


Extended Data Figure 5 | Optimization of MTZ-induced ablations. stain for mCherry at 48 hpf reveals disrupted morphology of mesoderm- 
a-d, Tg(thx6:gal4-vp16); Tg(UAS-E1b:Eco.NfsB-mCherry) embryos derived cells in the AF. Medio-lateral view of pectoral fins at 48 hpf treated 
were treated with 0, 20 and 40mM MTZ from 30-48 hpf and analysed at with 0 mM MTZ (n=8) (b), 20mM MTZ (n=7) (c), and 40mM MTZ 

48 hpf. a, Initial analyses assessed muscle transgene expression patterns, (n= 11) (d). Arrows indicate dead or dying cells based on cell morphology 
embryonic survival and overall phenotype severity as well as the efficiency and fragmentation, dashed lines indicate the border between the AF and 
of ablation of mCherry-expressing cells within the AF. b-d, Antibody underlying mesenchyme. Scale bars, 50 jum. 
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Time (sec) 
Extended Data Figure 6 | Optimization of the laser-ablation strategy. Fluorescent intensity of the ROI before (scaled to 1, the initial fluorescence 
a-l, Assessment of fluorescent recovery after laser ablation of Kaede- of the targeted cells) and after laser-mediated cell ablation. A very small 
positive cells in the AER of double-transgenic Tg(thx6:gal4-vp 16); amount of fluorescent recovery can be observed at 20 (c, m), 50 (f, m) and 
Tg(UAS:Kaede) embryos by scanning the region of interest (ROI) for 100 (i, m) scan iterations, while 200 scan iterations results in a consistent 
20 (a-c), 50 (d-f), 100 (g-i) and 200 (j-I) laser pulse iterations. m, and permanent loss of Kaede fluorescence (1, m). Scale bars, 10|1m. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


ablation time series 


-1 sec 


laser induced blebbing 


lsec d 


Cell specific laser ablation 


m » 
~ 


before after 


| laser induced ablation in Talia f/z:nlsKikGr1) 


[| before | after 


EennEateral acer ablated ablated | = coo 


DAPI Collagen2 


MI Mesenchyme 
Gl Fin Fold 


ns. 


ns. 


| 

Oo 

° ae 
Ne 
o 

cz) 
lee 
=|" 
rz) 
(tee 
Oo 

cz) 
olf 
o 

co) 


Extended Data Figure 7 | Validation of the laser-ablation strategy. 
a-h, Morphological disruption after laser ablation of Kaede-positive 
cells in the AER of Tg(thx6:gal4-vp16); Tg( UAS:Kaede) double- 
transgenic embryos. Arrows mark the ablated cell while the adjacent 
cell, indicated by arrowheads, is not affected. i-l, Membrane blebbing, as 
visualized by photoconverted Kaede (red), can be observed in the AER of 
Tg(tbx6:gal4-vp16); Tg(UAS:Kaede) double-transgenic embryos. Before 
ablation the cell is clearly labelled by non-photoconverted Kaede (green) 
(i), whereas after ablation, blebbing is observed at selected time points 
(arrows) (j-l). m-v, Proof-of-concept experiment in which each Kaede- 
positive cell in the AER is sequentially ablated from Tg(thx6:gal4-vp16); 
Tg(UAS:Kaede) double-transgenic embryos. Arrows mark ablated cells. 
Ablation is cell-specific, and does not occur in non-targeted Kaede- 
positive cells in the AER or the underlying mesenchyme. w-zc, Random 


ablation of AER cells before AFIC invasion does not affect fin formation. 
Using Tg(h2a f/z:nlsKikGR1) transgenic embryos, in which all nuclei are 
labelled by the photoconvertible protein Kikume, it is possible to ablate 
cells at 32 hpf within the AER before AFIC invasion. Arrows indicate 
targeted cells within the AER at 2s before ablation (w) and after ablation at 
2s (x), 12s (y) and 22s (z). za-zc, In each experimental condition, 6 cells 
were ablated per AER. No differences in actinotrichia formation or overall 
morphology can be detected when comparing contralateral control (za) 
with laser-ablated (zb) conditions. Detailed quantification of fin fold and 
mesenchyme length does not reveal any significant differences between 
contralateral control and laser-ablated conditions (zc). P values are by 
unpaired Student's t-test. Scale bars, 10 1m (a-I), 50,1m (m-v), 10}1m 
(w-z) and 100j1m (za-zc). Error bars in zc represent s.e.m. 
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Extended Data Figure 8 | Ablation of somite-derived cells from the 
AER results in increased proliferation of the underlying mesenchyme. 
a-d, Ablation of somite-derived cells in the AER in embryos transgenic 
for Tg(tbx6:gal4-vp16) and Tg(UAS:Kaede) results in an increased mitotic 
index of the underlying mesenchyme as measured by EdU pulse labelling. 
All samples were analysed at 48 hpf. The endoskeletal disk is undergoing 
condensation in controls (arrowheads) (a, b), while this is not evident 

in MTZ-treated animals (arrow) (c). d, Quantification of EdU staining 
observed in the mesenchyme of pectoral fins after MTZ-induced ablation 


ad 
hb 


mitotic index 


of somite-derived cells from the AER. e, Inhibition of Fgf signalling 
reduced the mitotic index in pectoral fins that have undergone MTZ- 
mediated ablation of somite-derived cells in the AER. The mitotic index of 
control fins is significantly lower compared to MTZ-treated pectoral fins. 
However, after treatment with the Fgf inhibitor $U5402, both MTZ-treated 
and untreated pectoral fins are reduced to a similar mitotic index. Scale 
bars, 401m. P values are by ANOVA with Tukey’s post-hoc analysis. Error 
bars in d and e represent s.e.m. 
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Extended Data Figure 9 | Progressive reduction of collagen secreting 
cells from the AER of different species in the gnathostome phylogeny. 
a-h, Embryos of the epaulette shark H. ocellatum stained for Collala 
(red), skeletal muscle (green) and nuclei (blue). a, b, At 36 mm 
(approximately S. canicula stage 26/27; ref. 32), Collal-expressing cells 
are present at low numbers within the AER initially, with cells evident 
traversing the AER basement membrane from the fin mesenchyme 
(arrows in b, b’). The density of Collala-expressing cells within the AER 
increases notably with the extension of the fin, and the cells are present 
in large numbers in 48 mm embryos (c, d), (approximately S. canicula 
stage 30/31; ref. 28). Collala-expressing cells are maintained at the edge 
of the fin fold at advanced stages of fin development (85 mm, e, f) and 
cluster into discrete foci at the edge of the fin fold by 90 mm (g, h). a, ¢, e, 
g, Dorsal views proximal to the top, inserts are high-magnification views 
of the boxed regions showing only Collala expression in red in each fin. 
b, d, f, h, Maximum projection confocal renders of the distal edge of the 
AER/AF region of each fin in a, c, e and g, respectively. b’, d’, f’, h’, Sagittal 
views of the rendered images in b, d, f and h, respectively, revealing the 
persistent localization of Collala-expressing cells at the distal most edge 


3 


of each fin (arrows). i-p, Embryos of the paddlefish P spathula, sectioned 
in the transverse plane and stained for Collala expression (brown, DAB) 
at 1 dpf, (i, j), 2 dpf (k, 1) and 4 dpf (m, n). Colonization of the AER by 
Collala-expressing cells follows a very similar pattern temporally and 
spatially to that evident in the zebrafish fin, but appear at a lower density 
than in the epaulette shark. 0, p, Control immunohistochemistry on 1-dpf 
sections with no primary antibody. j, 1, n, p, High-magnification views of 
the boxed regions in i, k, m and 0, respectively. Arrows in j, land n mark 
Collala-expressing cells in the AER. q-x, N. forsteri embryos in transverse 
section and stained for Collala and counterstained with H&E. Very few 
Collala-expressing cells colonize the AER initially (q, r, stage 42-44), and 
they continue to be present in low numbers up to stage 48 (s, t) and are 

no longer evident at fin extension and AF formation stages (u, v, stage 50). 
w, x, Control immunohistochemistry on stage-48 sections with no primary 
antibody. r, t, v, x, High-magnification views of the boxed regions in q, s, 

u and w, respectively. Arrows in inset images in r and t mark the limited 
number of Collala-expressing cells in the AER of lungfish embryos. Scale 
bars, 200 1m (a, c), 501m (b, f), 100 ,1m (d, h), 400 um (e) and 500 pm 

(g, i, k, m, 0, q, s, u, w). 
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Extended Data Figure 10 | Schematic representation of pectoral fin 
development and its relation to the fin-to-limb transition. 

a, Induction of fin fold formation and subsequent actinotrichia formation 
are dependent on somitic mesoderm-derived apical fold inducing 

cells (AFICs) cells infiltrating the AER. After the induction of fin bud 
formation (1), a somite-derived lineage (green) will infiltrate the AER 
(yellow, 2). Subsequent fin fold (red) induction (3) is dependent on this 
process, as is the formation of actinotrichia (dashed lines, 3 and 4), which 
rely on the secretion of collagen directly from these cells (green, 3 and 4). 
b, Throughout evolution of the gnathostome lineage, the cells that regulate 
fin fold induction have been gradually lost as a necessary prerequisite for 
limb formation. Schematic representation of fin development including 
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the transition from AER (yellow) to AF (red). The timing of somite- 
derived cells (green bars) infiltrating the AER coincides with the AER to 
AF transition. Relative numbers of somite-derived cells are represented by 
the height of the green bars. In cartilaginous fish, cells infiltrate the AER 
early and abundantly to remain in the fin fold throughout development. In 
ray-finned fish, infiltration into the AER is delayed and the number of cells 
infiltrating the AER is also reduced, but infiltration continues throughout 
development. The number of cells infiltrating the AER and their timing of 
residency is further reduced in lobe-finned fish, while finally in tetrapods 
there is an absence of somite-derived cells in the AER, and a lack of AF 
induction. 
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Indifference to dissonance in native Amazonians 
reveals cultural variation in music perception 


Josh H. McDermott!, Alan F. Schultz’, Eduardo A. Undurraga** & Ricardo A. Godoy* 


Music is present in every culture, but the degree to which it is shaped 
by biology remains debated. One widely discussed phenomenon is 
that some combinations of notes are perceived by Westerners as 
pleasant, or consonant, whereas others are perceived as unpleasant, 
or dissonant!. The contrast between consonance and dissonance is 
central to Western music””, and its origins have fascinated scholars 
since the ancient Greeks*"!°. Aesthetic responses to consonance are 
commonly assumed by scientists to have biological roots!!"!4, and 
thus to be universally present in humans!*!°. Ethnomusicologists!” 
and composers’®, in contrast, have argued that consonance is a 
creation of Western musical culture®. The issue has remained 
unresolved, partly because little is known about the extent of cross- 
cultural variation in consonance preferences!*, Here we report 
experiments with the Tsimane’—a native Amazonian society 
with minimal exposure to Western culture—and comparison 
populations in Bolivia and the United States that varied in exposure 
to Western music. Participants rated the pleasantness of sounds. 
Despite exhibiting Western-like discrimination abilities and 
Western-like aesthetic responses to familiar sounds and acoustic 
roughness, the Tsimane’ rated consonant and dissonant chords and 
vocal harmonies as equally pleasant. By contrast, Bolivian city- and 
town-dwellers exhibited significant preferences for consonance, 
albeit to a lesser degree than US residents. The results indicate 
that consonance preferences can be absent in cultures sufficiently 
isolated from Western music, and are thus unlikely to reflect innate 
biases or exposure to harmonic natural sounds. The observed 
variation in preferences is presumably determined by exposure to 
musical harmony, suggesting that culture has a dominant role in 
shaping aesthetic responses to music. 

We conducted two studies to measure consonance preferences in 
populations with varying exposure to Western music. In Study 1, we 
measured preferences for sounds in residents of the United States and 
compared them to three populations in Bolivia: (1) residents of the 
capital city (La Paz); (2) residents of a rural town (San Borja); and 
(3) members of a native society of horticulturalist-foragers 
(the Tsimane’) in a remote village (Santa Maria) in the Amazon rainforest. 
City- and town-dwellers were fluent in Spanish and generally had tele- 
visions and radios. By contrast, the Tsimane’ were mostly monolingual 
in their own language, lacked televisions, and had limited access to 
music via radio’’. The Tsimane village lacked electricity and tap water, 
was inaccessible by road, and could be reached only by canoe (Fig. 1a). 
Their contact with Western culture was mainly limited to occasional 
trips to nearby towns. To compare differences between cultures to 
intra-cultural variation within Westerners, we tested two groups from 
the United States—one with at least two years of experience playing 
a musical instrument, and one with at most one year of experience. 

The Tsimane’ were of particular interest because harmony, polyph- 
ony, and group performances are by all accounts absent from their 
music. This conclusion was suggested by previous recordings and 


documentation” and was substantiated by interviews with Tsimane 
musicians (see Methods for a description of Tsimane’ music, and 
Supplementary Audio 1 and 2 for examples of Tsimane’ songs). The 
apparent absence of harmony raised the question of whether they 
would exhibit aesthetic responses to consonance and dissonance 
despite not having prior exposure to them. 

Participants were presented with sounds over headphones (Fig. 1b) 
and rated the pleasantness of sounds on a four-point scale. We observed 
pronounced differences between groups in the ratings of consonant 
and dissonant chords (Fig. 2a, b). US residents showed strong conso- 
nance preferences, as expected from previous studies”', with musicians 
showing stronger preferences than non-musicians (synthetic chords: 
F(46,1) =7.44, P= 0.009; sung chords: F(46,1) = 12.97, P=0.001). 
Moreover, Bolivian city- and town-dwellers displayed more modest 
but statistically significant preferences for consonance (P< 0.01 in all 
cases, t-tests). By contrast, the Tsimane’ rated consonant and dissonant 
chords as equally pleasant. This finding held for chords composed of 
synthetic tones as well as chords composed of recorded notes sung by a 
vocalist, producing an interaction in both cases between stimulus class 
and participant group (synthetic: F(124,4) = 48.43, P< 10°; sung: 
F(126,4) = 26.79, P< 10~'°). Tsimane’ ratings varied somewhat across 
chords (synthetic: F(9,270) = 1.96, P= 0.04; sung: F(9,288) = 1.93, 
P=0.05), with some tendency for larger pitch intervals to be preferred 
over smaller ones (Extended Data Fig. 1), but without consistent dif- 
ferences between consonant and dissonant chords. 

Similar results were obtained when participants rated single tones 
whose frequencies were manipulated to be harmonic (multiples of a 
common fundamental frequency) or inharmonic, a distinction believed 
to differentiate consonant from dissonant chords*!”!%.212? (Fig. 2c). 
US residents showed a preference for harmonic over inharmonic 
tones, but the effect was stronger in musicians than in non-musicians 
(F(46,2) = 16.12, P=0.0002), was attenuated in Bolivian city-dwellers, 
and was undetectable in town-dwellers and the Tsimane; again 
producing an interaction between stimulus and participant group 
(F(157,4) =22.49, P< 10713). 

To rule out the possibility that participant groups might have varied 
in their understanding of task instructions, or in their ability to assign 
pleasantness ratings to sounds, we asked them to rate recordings of 
emotional vocalizations” (Fig. 2d). In this case all groups showed a 
preference for laughter over gasps (F(157,1) =200.48, P< 1 x 10~”°), 
with no interaction with group (F(157,4) = 1.09, P=0.36). Thus, for 
familiar sounds expected to have a positive or negative association, the 
Tsimane exhibited a preference in the expected direction, comparable 
to that of the other groups. 

To test whether the Tsimane’ would generally fail to exhibit prefer- 
ences for unfamiliar sounds, we obtained ratings for synthetic tones 
that varied in roughness, another quality known to affect aesthetic 
responses in Westerners®””!~*° (Fig. 2e). Roughness was manipulated 
by presenting pairs of frequencies to the same or different ears. In 
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Figure 1 | Location and setup of experiments. a, Map of the region 

from which Bolivian participants were drawn. Participants resided in 

the Bolivian capital city La Paz, the rural town of San Borja, the Tsimane’ 
village of Santa Maria (Study 1), or Tsimane’ villages around San Borja 
(Study 2; not labelled with names to minimize clutter). Town symbols are 
approximately proportional in size to town population. Colour of territory 
denotes elevation. b, Sounds were presented over closed headphones via 
laptop (charged with a gasoline generator when needed). For all but the 
discrimination experiment (Fig. 4), participants provided a pleasantness 
rating (with a four-point scale) following each sound. 


contrast to the results for chords, and for harmonic and inharmonic 
tones, all groups exhibited a significant preference for smooth over 
rough tones (F(157,1) =85.51, P< 107'°), with no interaction with 
group (F(157,4) = 2.04, P=0.09), even though for US participants 
this preference was the smallest of those measured. 

We replicated and extended these results in Study 2, testing a differ- 
ent group of Tsimane listeners and a comparison group of musically 
experienced listeners in the United States. We first tested whether the 
absence of a consonance preference would extend to harmony in real- 
istic musical material. We recorded Tsimane’ vocalists singing Tsimane’ 
song phrases several times, and then pitch-shifted and superimposed 
the phrases to create harmonies in conventionally consonant or disso- 
nant intervals (see Supplementary Audio 3-10). We obtained pleasant- 
ness ratings for these harmonies from US and Tsimane listeners as in 
Study 1. For replication purposes, we also conducted analogous experi- 
ments with sung and synthetic two-note intervals, and with sung triads. 

The results with chords replicated those of Study 1 (Fig. 3a—c): in 
every case, US listeners preferred consonance to dissonance, while 
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Figure 2 | Results of Study 1. a-e, Average pleasantness ratings of sounds 
from five experiments across five populations: 23 US musicians (US-M), 
25 US non-musicians (US-NM), 24 Bolivian city-dwellers (Capital), 

26 Bolivian town-dwellers, and 64 Tsimane’. Each experiment featured 
sounds from two classes expected to differ in pleasantness for US listeners. 
Chord notes were either synthetic (resembling a piano), or recorded from 
a trained singer. Chords were conventionally consonant (major third, 
perfect fourth, perfect fifth, and major triad) or conventionally dissonant 
(minor second, major second, tritone, major seventh, and augmented 
triad). Vocalizations were recordings of human laughs and gasps. Synthetic 
tones varied in harmonicity or roughness. Asterisks denote statistical 
significance: *P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001; n.s., 
not significant (two-tailed t-tests, uncorrected for multiple comparisons 
because they were conducted post-hoc, following analysis of variances 
(ANOVAs) to test for main effects and interactions). Data are mean and s.e.m. 


Tsimane listeners did not, producing interactions between stimulus 
and participant group (synthetic intervals: F(72,1) =42.4, P< 107°; 
sung intervals: F(95,1) = 39.4, P< 1077; sung triads: F(95,1) =17.9, 
P<10~“). Ratings of individual chords by Tsimane listeners again 
varied across both synthetic and sung intervals (Extended Data Fig. 2), 
but were primarily explained by interval size, with higher ratings for 
larger intervals, unlike US listeners (significant correlations between 
rating and interval size for Tsimane; synthetic: r=0.94, P< 107%; sung: 
r=0.81, P=0.001; but not for US listeners; synthetic: r= 0.07, P=0.82; 
sung: r= 0.23, P=0.48). 

Notably, similar results were obtained with harmonies generated 
from Tsimane songs (Fig. 3d). Even though the music was foreign to US 
participants, they reliably judged consonant renditions as more pleasant 
than dissonant (t(46) =6.2, P< 10~°), whereas the Tsimane’ did not 
(t(49) = 1.2, P=0.22; stimulus x group interaction: F(95,1) = 30.2, 
P<10~°). Moreover, Tsimane listeners reliably preferred some of the 
song excerpts used to generate harmonies over others (7(25) = 49.01, 
P=0.003; Extended Data Fig. 3). The materials thus elicited consistent 
aesthetic responses in the Tsimane, but these were not driven by con- 
sonance and dissonance. As in Study 1, when presented with recorded 
vocalizations (Fig. 3e), both Tsimane’ and US listeners showed prefer- 
ences for laughter over gasps (F(95,1) =129.4, P< 107'%; no interac- 
tion with participant group: F(95,1) = 1.8, P=0.18), indicating that 
Tsimane’ listeners could readily perform the task. These results again 
suggest that the preference for consonance is absent in the Tsimane’. 

To explore the effects of harmonicity and roughness found in Study 1, 
we measured pleasantness ratings for pairs of pure tones (single 
frequencies) separated by intervals from the chromatic scale (0-8 
semitones)! (Fig. 3f). This range includes some consonant intervals, 
for which the tone frequencies approximate harmonics of a common 
fundamental (and are thus related by simple integer ratios), and some 
dissonant intervals, for which the tone frequencies are inharmonic. 
Headphones were used to present the two tones to both ears (diotic 
presentation) or to separate ears (dichotic presentation), as in the 
smooth/rough tone experiment from Study 1. Diotic presentation 
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Figure 3 | Results of preference experiments from Study 2. ad, Average 
pleasantness ratings of consonant and dissonant musical stimuli by 47 US 
musicians (US-M) and 50 Tsimane. Two-note chords were intervals from 
the chromatic scale. Triads were major or augmented. Vocal harmonies 
(intervals from the chromatic scale) were generated from recordings of 
Tsimane’ vocalists. e, Average pleasantness ratings of human laughs and 
gasps. Asterisks denote statistical significance: ****P < 0.0001 (two-tailed 
t-tests) (a—e). f, Average pleasantness ratings of concurrent pure tones 
(single frequencies)! presented diotically (both tones to both ears) or 
dichotically (each tone to a different ear). Blue and pink asterisks indicate 
ratings significantly different from that of the unison (P < 0.05; Wilcoxon 
signed rank tests, two-tailed, uncorrected for multiple comparisons). Black 
curves plot the difference between ratings of dichotic and diotic stimuli. 
Black asterisks denote dichotic-diotic differences significantly different 
from zero (P< 0.05; Wilcoxon signed rank tests, two-tailed, uncorrected). 
Data are mean and s.e.m. 


produces roughness (via beating) when the tones are close in frequency, 
whereas dichotic presentation minimizes beating while preserving the 
musical interval (ratio) between frequencies. The stimuli thus dissoci- 
ated roughness and harmonicity. 

Among US residents, pleasantness ratings dipped at dissonant inter- 
vals (the minor second, major second, and tritone) for both diotic and 
dichotic presentation. The variation in pleasantness for dichotic inter- 
vals (y7(7) =62.7, P< 10 !°) implicates factors other than roughness 
in listeners’ aesthetic judgments, potentially that of harmonic frequency 
relations. However, for small intervals (<3 semitones), diotic ratings 
were nonetheless lower than dichotic ratings, evident in the difference 
curve, which varied significantly with interval ( \7(7) = 20.6, P=0.004; 
significant differences from zero only for 1 and 2 semitones, P < 0.05). 
This effect is the signature of an aversion to roughness, which is audible 
only for small intervals (for which frequencies interact via cochlear 
filtering)>7!. 

Tsimane’ ratings also suggested an aversion to roughness: diotic 
intervals were again rated lower than dichotic only for small inter- 
vals, producing significant variation in the diotic-dichotic difference 
(\7(7) = 29.25, P= 0.0001; significant differences again only for 1 
and 2 semitones, P < 0.05). By contrast, the influence of harmonic- 
ity was absent. Ratings of dichotic intervals did not vary with interval 
(x7(7) =9.52, P=0.22), and the dip at the tritone was not present for 
either presentation mode. These results provide further evidence that 
an aesthetic aversion to roughness is present in the Tsimane’ but that 
the aesthetic response to consonance (and simple integer frequency 
ratios) is not. This dissociation substantiates the conclusion that the 
Western notion of dissonance is distinct from acoustic roughness, but 
closely related to inharmonicity?!”. 

The Tsimane indifference to consonance and harmonicity raises the 
question of whether they can hear the underlying stimulus distinction. 


To test their ability to discern harmonic from inharmonic frequency 
relations, we conducted a discrimination experiment in which listen- 
ers judged whether a stimulus contained one or two sounds (Fig. 4a). 
Listeners heard the first six harmonics of a synthetic tone. On half of 
the trials, the even harmonics were mistuned or presented earlier in 
time. Both manipulations used an established sound segregation cue to 
cause the even and odd harmonics to be heard as distinct sounds”*?’. 
In particular, the mistuning disrupted harmonicity, causing the 
even harmonics to have a different fundamental frequency than the 
odd harmonics, producing the perception of a dissonant interval in 
Western listeners. 

US residents reliably identified the intended number of tones in 
both conditions, as expected (Fig. 4b, left: (46) = 22.9, P< 10~°; 
t(46) =54.4, P< 10~‘”). Tsimane’ listeners were worse overall at the 
task, but were well above chance in both cases (Fig. 4b, right: t(48) = 7.3, 
P< 107%; t(48) =11.8, P< 107), indicating sensitivity to harmonicity. 
Analysis of subsets of Tsimane’ participants revealed that even those 
who performed best on this task (mean sensitivity index (@’) of 2.0 for 
the mistuned condition) remained indifferent to dissonance, disso- 
ciating perceptual sensitivity and preference (Extended Data Fig. 4). 

Taken together, the results suggest that the Tsimane’ are able to make 
pleasantness judgments about sounds. Moreover, for familiar sounds 
associated with a valence (laughs and gasps), and for synthetic sounds 
varying in roughness, their judgments resemble those of Westerners. 
In addition, they are able to hear the acoustic distinctions associated 
with consonance and dissonance, discriminating harmonic from inhar- 
monic frequencies. The absence of a measurable Tsimane’ preference 
for consonance over dissonance thus appears to specifically reflect 
differences in their aesthetic response to this contrast. 

The cross-cultural variation we observed suggests that consonance 
preferences are unlikely to be innate, and that they are not driven by 
exposure to harmonic natural sounds such as vocalizations!*. Instead, 
consonance preferences seem to depend on exposure to particular types 
of music, presumably those that feature consonant harmony. Incidental 
exposure to Western music could explain the modest preferences 
evident in Bolivian city- and town-dwellers, along with those previously 
reported in human infants”*~”. The lack of harmony in Tsimane’ music 
leaves open the question of what preferences might be observed in indi- 
viduals exposed to conventionally dissonant intervals, as are prevalent 
in some non-Western music!”. But Western music is presumably not 
unique in fostering preferences for some tone combinations over others. 

The belief that consonance is biologically determined has been fuelled 
by observations that consonance and dissonance differ in acoustic 
properties such as harmonicity and roughness**”*!??12, and as 
such are not arbitrary categories. However, we found preferences for 
harmonic tones, for simple integer frequency ratios, and for conso- 
nant chords/harmonies to all co-vary with presumptive exposure to 


28 JULY 2016 | VOL 535 | NATURE | 549 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Western culture. Thus, rather than being an inevitable consequence 
of auditory system biology, it seems that the preferences exhibited by 
Western listeners for harmonic frequencies arise from exposure to 
Western music because the musical structures prevalent therein tend to 
produce harmonic frequencies”. And although an aversion to rough- 
ness is apparently present cross-culturally, it seems to be unrelated to 
consonance and dissonance, perhaps because musical sounds are in 
practice not very rough”!”. The role of biology in consonance could 
therefore be limited to constraining what is discriminable, potentially 
rendering some aesthetic contrasts more readily acquired than others. 

The roles of culture and biology in consonance and dissonance have 
remained unresolved in part because cross-cultural experimental data 
are rare. Studies of culturally isolated populations are increasingly dif- 
ficult to conduct due to the diffusion of Western culture around the 
world, but our results underscore their importance for revealing the 
diversity of human musical behaviour. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Participants (Study 1). US musicians. The first group of US residents (n = 23, 
19 female; mean age = 20.2 years, s.d. = 2.4 years) was selected to have experience 
playing a musical instrument (mean number of years spent playing an instru- 
ment =7.7 years, s.d.=4.9 years, range = 2-18 years). None were professional 
musicians, but all reported having taken music lessons or classes for much of the 
time that they were musically active (mean number of years during which lessons 
were taken = 6.8 years, s.d.= 4.3 years). All were residents of New York City. Two 
participants had immigrated to the United States as young children (at the ages of 
2 and 4 years old); the rest were born and raised in the United States. 

US non-musicians. The second group of US residents (n= 25, 12 female; mean 
age = 37.9 years, s.d.= 15.1 years) was selected to have minimal experience play- 
ing or studying music (mean number of years spent playing an instrument or 
singing = 0.2 years, s.d.=0.3 years, range = 0-1 years). 17 out of 25 (68%) had 
never played an instrument or sung in an organized setting, and most had never 
taken music lessons or classes (mean number of years during which lessons were 
taken=0.1 years, s.d.=0.3 years). All were residents of the Boston metropolitan 
area and had been born and raised in the United States. No attempt was made to 
achieve demographic diversity among US participants because numerous simi- 
lar experiments had previously been conducted in a variety of populations from 
Western societies*!??!-33, and the effects are known to generalize across age, gen- 
der, and geographical region of North America. However, the non-musician US 
sample was similar in demographic composition and musical experience to each 
of the Bolivian populations tested. 

La Paz (capital). Participants (n= 24, 14 female; mean age = 29.9 years, s.d.=9.3 
years) had been born and raised in La Paz, the capital of Bolivia. Only three 
reported any experience playing music (across the group: mean number of years 
spent playing an instrument = 0.3 years, s.d. =0.8 years), and only five reported 
experience singing (across the group: mean number of years spent singing = 0.5 
years, s.d.=1.1 years). 

San Borja (town). Participants (n = 26, 15 female; mean age = 31.6 years, s.d. = 12.9 
years) had resided for most of their life in San Borja, a small town in the depart- 
ment of Beni in the Amazon basin. San Borja could be reached by car during dry 
months of the year from other towns in Bolivia, but was accessible only by plane 
during much of the rainy season owing to land slides and flooding. Most of the 
townspeople tested (17 out of 26) reported having experience singing, but only 
8 out of 26 had experience playing a musical instrument. 

Tsimane’. Participants (n = 64, 31 female; mean age = 31.5 years, s.d. = 10.2 years) 
lived in a small rainforest village (Santa Maria) without electricity or tap water. 
Santa Maria could only be reached by canoeing up the Maniqui River (an indi- 
rect tributary of the Amazon). Age data are approximate, as many Tsimane’ do 
not know their precise age in years*!. Results of Fig. 2 were similar (all statistical 
comparisons produced the same result) if we limited the analysis to participants 
between the ages of 16 and 35, who were more likely to know their exact age 
(n=41, mean age of 25.0 years, s.d.=5.8 years). Music performance among the 
Tsimane’ was largely limited to a small number of expert practitioners who would 
perform traditional songs at social gatherings; most Tsimane’ participants did not 
perform music themselves. 

Sample sizes for the United States were chosen based on previous knowledge of 

effect sizes and variability”! Sample sizes for Bolivia were chosen to approximately 
match the US groups (capital and town) or to be as large as possible given practical 
constraints (Tsimane’). See Extended Data Table 1 for a summary of the participant 
group demographics for Study 1. 
Chord stimuli and analysis (Study 1). Chord stimuli were taken from a previous 
study”). Pleasantness ratings were obtained for 10 chord types: the minor second, 
major second, tritone, major seventh, and augmented triad (conventionally dis- 
sonant in Western music), and the major third, perfect fourth, perfect fifth, and 
major triad (conventionally consonant in Western music), along with the unison 
(conventionally neutral in Western music). The consonant and dissonant chords 
had been found previously to yield high and low pleasantness ratings, respectively, 
in Western listeners”). All chords were composed of intervals from the equal- 
tempered scale. 

Synthetic chords were composed of synthetic notes: complex tones containing 
the first 10 harmonics (in sine phase). Harmonic amplitudes were attenuated at 
14 dB per octave to mimic naturally occurring musical note spectra. Synthetic 
chords were given temporal envelopes that were the product of a half-Hanning 
window (10 ms) and a decaying exponential (decay constant of 2.5s~!) that was 
truncated at 2s. 

Sung chords were composed of notes (using the vowel /u/; ‘ooh’) produced and 
recorded by a professional singer (A. Lehr). They were high-pass filtered with a 
cutoff frequency of 150 Hz (fourth order Butterworth filter) to eliminate handling 
noise from the microphone. The temporal envelope of the onset of the sung notes 
was not altered apart from applying a half-Hanning window (10 ms). A linear ramp 
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was applied over the last half of each note to fade them smoothly down to zero, such 
that the duration was the same as the synthetic notes. Example stimuli (taken from 
ref. 21) are available online: http://mcdermottlab.mit.edu/consonance_examples/ 
index.html. 

Each chord was presented four times, each time with a different root note, 
drawn from C#4, D#4, F4, and G4 for the synthetic chords and G#3, A#4, B#4, 
and D4 for the sung chords, where C4 denotes middle C (the range was lower 
for the sung chords to accommodate the range of the singer). There were thus a 
total of 40 chords presented for each of the synthetic or sung chord experiments. 
Sung and synthetic chords were presented in separate blocks. Tsimane’ participants 
completed either the sung (n = 33) or the synthetic (n= 31) chord block due to 
time constraints. All other participants completed both blocks. 

Ratings were averaged across consonant chords and dissonant chords to yield 
a single average rating for each aesthetic class for each participant. Apart from the 
analysis in Extended Data Fig. 1, all statistics were performed on these averaged 
ratings. The ratings of the unison were omitted from the main analysis to simplify 
data presentation (to two stimulus conditions per experiment). 
Harmonic/inharmonic synthetic tone stimuli (Study 1). The harmonic and 
inharmonic tone stimuli were taken from a previous study”!. The harmonic tones 
contained a subset of the frequencies of a normal harmonic tone. There were three 
types of harmonic tone: one containing harmonics 1, 2, 4 and 8, one containing 
harmonics 1, 2, 3, 5 and 9, and one containing just the first harmonic. There were 
also three types of inharmonic tone. Two of them were modifications of the second 
harmonic stimulus. In the first type of inharmonic tone (jittered), the harmonic 
frequencies were perturbed up (even components) or down (odd components) by 
0.5 semitones. In the second type of inharmonic tone (shifted), all the harmonic 
frequencies were increased by 30 Hz. To avoid differences in beating between the 
harmonic and inharmonic tones, alternate frequency components were presented 
to opposite ears (the wide spacing between components additionally minimized 
beating). The third type of inharmonic tone consisted of two pure tones sepa- 
rated by 1.5 semitones, presented dichotically to minimize beating. The resulting 
stimulus was inharmonic, while being roughly matched in frequency to the 
single-frequency harmonic stimulus, but without introducing salient differences 
in beating. 

Inharmonicity detection thresholds for a single mistuned frequency compo- 
nent in a harmonic complex have been previously estimated to be about 1% of 
the harmonic’s frequency”®. Our inharmonic perturbations were well above this: 
~3% for the jittered condition, between ~1.5-10%, depending on the harmonic, 
for the shifted condition, and 9% for the dichotic pure tone stimulus. Each of 
the perturbations therefore produced audible inharmonicity, which to a typical 
Western listener is perceived as unpleasant”'. 

All harmonic and inharmonic stimuli had components whose amplitudes 
decreased by 14 dB per octave to resemble naturally occurring sounds. All tones 
were given temporal envelopes that were the product of a half-Hanning window 
(10 ms) and a decaying exponential (decay constant of 2.5 s!) truncated at 2s. 

Each type of tone was presented four times, each time with a different base 
frequency, drawn without replacement from the 4-semitone range above middle C. 
Smooth/rough synthetic tone stimuli and analysis (Study 1). The smooth and 
rough tone stimuli were taken from a previous study”, Tones that were perceptu- 
ally rough or smooth were generated by presenting pairs of single frequencies to 
either the same or different ears (diotic and dichotic presentation, respectively). 
Diotic presentation of two similar but non-identical frequencies is known to pro- 
duce the ‘rough’ sensation of beats, typically considered unpleasant by Western 
listeners>”!5. In contrast, dichotic presentation of two such frequencies greatly 
attenuates perceived beats*°, but leaves the spectrum (and its pitch) unchanged 
relative to the diotic version*®. The frequencies composing each stimulus were 
separated by either 0.75 or 1.5 semitones (1.5 for the low- and mid-frequency 
ranges, and 0.75 for the high-frequency range, to produce beat frequencies with 
prominent roughness), such that considerable beating was heard when presented 
diotically>”". These stimuli, like the harmonic/inharmonic synthetic tones, were 
given envelopes that were the product of a half-Hanning window (10 ms) and a 
decaying exponential (decay constant of 2.5s~') truncated at 2s. 

We generated smooth/rough tones in three frequency ranges. Each type of tone 
was presented four times, each time with a different base frequency, drawn without 
replacement from the 4-semitone range above either middle C (low-frequency), 
one octave above middle C (medium-frequency), or two octaves above middle C 
(high-frequency). The low-frequency dichotic tone also served as an inharmonic 
stimulus that could be compared to the single frequency harmonic stimulus. 

The harmonic/inharmonic and smooth/rough tones were presented in random 
order in a single block. There were four exemplars of each of the eleven stimulus 
types, for a total of 44 trials. To analyse the harmonic/inharmonic tone data, the 
ratings of the three harmonic stimuli were averaged together, as were the ratings 
of the three inharmonic stimuli (one of which was the low-frequency dichotic 
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frequency pair stimulus). All statistics were performed on these two average rat- 
ings. To analyse the smooth/rough tone data, the ratings of the three dichotic 
(smooth) stimulus types were averaged together, as were the three diotic (rough) 
stimulus types. All statistics were performed on these two average ratings. 
Vocalization stimuli (Study 1). Vocalizations were a subset of the Montreal 
Affective Vocalization Set®’, which consists of vocalizations produced by actors in 
a laboratory setting. Five vocalizations were selected from each of the categories of 
laughter, gasps, and crying, which had previously been found to be rated as pleasant 
(laughter) and unpleasant (gasps and cries) by North American listeners*”. The 
stimuli were presented in random order in a single block of 15 trials. The crying 
sounds gave similar results to the gasps (low ratings by all groups) and their ratings 
were omitted from the analysis to simplify the data presentation (to two stimulus 
conditions per experiment). 

Stimulus presentation (Study 1). All stimuli were normalized to have the same 
rms level. Stimuli were played out by a laptop computer over closed headphones 
(Sennheiser HD 280Pro). The volume was set to a level that was determined in 
pilot experiments to be comfortable (approximately 70 dB SPL), and that was fixed 
across participants. 

Experimental protocol (Study 1). Sound stimuli for a given experiment were 
presented sequentially in random order. After each stimulus was presented, par- 
ticipants gave their rating verbally (participants could respond “like it a lot’, “like 
it a little? “dislike it a little’, or “dislike it a lot”), and the experimenter recorded the 
response on a data entry sheet. To avoid biasing participants’ responses or data 
entry, experimenters were always blind to the stimulus being presented, and stimuli 
were audible only to the participant. 

The experiments were completed in a single sitting along with several other 
experiments that are not reported here. Experiment order was counterbalanced 
across the participants in each group. Experiments were conducted in June-July 
2011 (Tsimane’), January-February 2012 (San Borja and US musicians), July 2012 
(La Paz), and March-April 2014 (US non-musicians). 

Both studies were approved by the Tsimane’ Council (the governing body 
of the Tsimane’ in the Maniqui basin, where the studies took place), as well as 
the Committee on the Use of Humans as Experimental Subjects at MIT and the 
Committee for Protection of Human Subjects at Brandeis. Experiments were con- 
ducted with the informed consent of the participants. 

Participants (Study 2). US musicians. A group of musically experienced US res- 
idents was tested to provide baseline preference measures. They were selected to 
have experience playing a musical instrument (n= 47, 32 female; mean age = 25.9 
years, s.d.=7.5 years; mean number of years spent playing an instrument = 8.4 
years, s.d.=6.5 years, range = 1-25 years). None of the participants were profes- 
sional musicians; we label them here as ‘musicians’ primarily to distinguish them 
from the non-musicians of Study 1 who were selected to have almost no experi- 
ence playing music or singing. All were residents of the Boston metropolitan area 
and had been born and raised in the United States. All participants completed all 
experiments in the study. 

Tsimane’. Participants (n = 50, 27 female; mean age = 27.4 years, s.d.=9.8 years; 
as in Study 1, ages are approximate) lived in five small villages surrounding San 
Borja (Arenales, Manguito, Pachual, Limoncito, and Tacuaral). Unlike the village 
of Santa Maria from Study 1, the villages of Study 2 could be reached by automobile 
during the dry season and had begun to be equipped with electricity at the time of 
testing (some families were connected to the electrical grid). The choice of villages 
was dictated by accessibility given the local weather conditions during the study 
(roads were often muddy and impassable). We tested a larger group of participants 
and only analysed data from the subset who did not own a radio, to minimize the 
chance that they had significant exposure to Western music. Only 27 of these 50 
participants completed the synthetic interval experiment (it was substituted for 
another experiment half-way through our fieldwork). One of the 50 participants 
did not complete the discrimination experiment. The size of the Tsimane’ group 
was as large as possible given the practical constraints of the study (the US group 
was chosen to approximately match it in size). See Extended Data Table 2 fora 
summary of the participant group demographics for Study 2. 

Chord stimuli (Study 2). Chord stimuli were generated as in Study 1. The synthetic 
and sung two-note chord experiments presented intervals from 0 to 11 semitones 
at each of 4 different root-pitches, yielding 48 trials in total. The sung triad exper- 
iment presented the major, minor, diminished, and augmented triads each at 4 
different root-note pitches, yielding 16 trials in total (in the main analysis only the 
major and augmented triads were analysed, as they produced the largest contrast 
in US listeners). The root notes for the synthetic chords were the first four notes 
above middle C. For both sung chord experiments, the root notes were the first 
four notes above G#3 to accommodate the vocal range of the singer. The notes in 
the sung chord experiments were sung vowels (“aah” for two of the exemplars and 
“ooh” for the other two, randomly assigned to the root notes). Within experiments, 
trials were randomly ordered. 


Vocal harmony stimuli (Study 2). Because the Tsimane’ generally do not sing 
in groups and were unable to sing or play in coordination upon request, it was 
necessary to generate harmonies from solo recordings. We recorded two Tsimane’ 
vocalists singing the same melodic phrase several times in a row. The phrases 
were chosen by the vocalists to be Tsimane’ songs they knew well and tended to 
sing in social settings. We selected sets of phrases by each singer with very simi- 
lar timing (by visually comparing the waveforms), yielding 26 phrases with two 
repetitions each. We then pitch-shifted one of the phrases in a set and added it to 
the other (unshifted) version of the phrase. Pitch shifting was performed using 
STRAIGHT. The experiment presented harmonies with intervals ranging from 
1 to 11 semitones, as well as single phrases and the unison (two different repetitions 
of the same phrase at the same pitch). There were 4 exemplars of each interval, each 
generated from a different melodic phrase, yielding 52 trials in total. Each of the 
26 sung phrases was used twice in the experiment. We generated 15 random assign- 
ments of phrases to conditions with the constraint that the two occurrences of a 
phrase not be assigned to the same condition. One of these assignments was chosen 
at random for each participant. Although the song excerpts were not selected to 
be widely recognizable (they were simply those that the singers were comfortable 
singing), 19 out of the 50 participants were able to name at least one song that they 
recognized during the experiment. 

Pure tone interval stimuli (Study 2). Pure tone intervals were generated by the 
same procedure used for the smooth/rough synthetic tone stimuli from Study 1, 
the only difference being that the tones in a pair were separated by integer numbers 
of semitones. The experiment presented intervals from 0 to 8 semitones (in diotic 
and dichotic versions), each at 2 different root-note frequencies (523 and 740 Hz). 
The diotic and dichotic 0-semitone conditions were identical. 

Discrimination experiment stimuli/task (Study 2). Listeners heard harmonics 
1-6 of a fundamental frequency (FO). On one-quarter of trials the even-numbered 
harmonics were mistuned by +12%, rendering the tone inharmonic and causing 
the even and odd harmonics to segregate and be heard (by typical Western listen- 
ers) as two separate tones. On another quarter of trials the even harmonics onset 
150 ms earlier than the odd harmonics, again causing the even and odd harmonics 
to perceptually segregate. On the remaining 50% of trials the even harmonics were 
unperturbed and the complex tone tended to be heard as a single sound. The 
task was to judge whether a trial contained one or two sounds. After each trial 
the experimenter told the participant if they had answered correctly (the correct 
answer was considered to be “one” when the harmonics were unperturbed, and 
“two” when mistuned or offset). The tone FO took 9 different values (0, 4 and 8 
semitones above 200, 300 and 400 Hz), yielding 36 trials in total. Harmonics were 
750 ms in duration and were given flat amplitude envelopes that began and ended 
with half-Hanning windows (10 ms). Stimuli were presented at 75 dB SPL. 

The experiment was preceded by a practice experiment that was identical in 
all respects except that only 5 FO s were used (200, 282.8, 300, 400 and 424.3 Hz), 
yielding 20 trials in total. 

Stimulus presentation (Study 2). Same as in Study 1. 
Experimental protocol (Study 2). Pleasantness ratings were obtained as in Study 1, 
except that the experimenter entered responses directly into the computer. 

The experiments were completed in a single sitting. Experiment order was 

generated randomly for each participant, with the exception that the discrimi- 
nation experiments were conducted first or last with equal probability (because 
the instructions were different than for the pleasantness rating experiments). 
Experiments were conducted in July 2015 (Tsimane’) and July—October 2015 (US 
musicians). 
Analysis. For preference experiments, the ratings for each participant were aver- 
aged across the exemplars for a condition (for example, laughs or gasps for the 
vocalization experiment, or major triad for one of the chord experiments), yielding 
one mean rating per condition per participant per experiment. In the chord, tone, 
and vocal harmony experiments, most analyses were done after averaging these 
mean ratings across groups of conditions, for example, the consonant chords/ 
harmonies, or the harmonic tones, again yielding one mean rating for each partici- 
pant for each group of conditions in each experiment. The results graphs plot these 
mean ratings averaged across participants, and the error bars plot the standard 
error of these mean ratings across participants. 

For the discrimination experiment, we obtained one measure of sensitivity (d’) 
for each condition for each participant from their hit and false alarm rates. The 
results graphs plot the mean and standard error of this sensitivity measure across 
subjects. 

Statistics. Repeated-measures ANOVAs were used to test for main effects of 
stimulus condition and for interactions between the effect of stimulus condition 
and participant group. Mauchly’s test was used to test for violations of the sphe- 
ricity assumption. When Mauchly’s test was significant, the Greenhouse-Geisser 
correction was used. Pair-wise comparisons were made with two-tailed t-tests, 
and were not corrected for multiple comparisons because they were preceded by 
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ANOVAs that revealed significant main effects or interactions. Pearson correlations 
were used to determine the relationship between chord ratings and interval size 
(Extended Data Fig. 2) and to test the reliability of ratings of song excerpts and 
vocal harmonies (Extended Data Fig. 3). Data distributions were assumed to be 
normal, and were evaluated as such by eye. The exceptions to the above were in 
the analysis of individual melodic phrase ratings in the vocal harmony experiment 
(Extended Data Fig. 3) and of the pure tone interval experiment from Study 2 
(Fig. 3f), in which only two ratings per condition were collected per subject due 
to the large number of conditions (producing non-normal data distributions), 
necessitating non-parametric tests. In these cases, Friedman’s non-parametric test 
of differences among repeated measures was used to test for main effects, and the 
Wilcoxon signed rank test was used for pairwise comparisons. 

Tsimane’ music. Tsimane music consists of songs that have apparently been passed 
down for generations. At present these songs are sung primarily at social gather- 
ings by particular individuals who know the songs, typically when adults gather 
to consume “chicha’, a traditional homemade alcoholic beverage. The primary 
documentation of Tsimane’ music comes from the work of J. Riester and G. Rocckel 
(an anthropologist and ethnomusicologist, respectively), who compiled 140 tradi- 
tional songs and analysed their subject matter and musical content”’, We have also 
compiled Tsimane’ songs as part of a broader longitudinal study”. Known Tsimane’ 
songs tend to reflect their everyday lives, concerning production activities (such as 
hunting), division of labour, love, social organization, and religion. Musically, many 
songs are based on anhemitonic pentatonic scales. Others are constructed from 
three notes spanning a perfect fourth (typically divided into intervals of a whole 
tone and minor third). Examples of recorded songs are provided in Supplementary 
Information (Supplementary Audio 1 and 2). Instrumental renditions of songs 
are also common, typically on wind or string instruments. Drums exist but do not 
seem to be used to accompany other instruments or singing, at least not any more. 
There is one report of Tsimane’ performing traditional dances in the mid-twentieth 
century*®, but to the best of our knowledge, traditional dancing is no longer 
practiced. 

For our purposes, the most interesting feature of Tsimane’ music is that group 
performances appear to be absent. We substantiated this over several days of 
interviews with 10 Tsimane’ musicians. They consistently reported that musical 
performances often occur in social settings, but that only one person performs at 
atime. Multiple musicians might perform in succession, but not concurrently; 
instrumental accompaniment to singing is rare. We were initially sceptical of this 
claim, and so brought pairs of musicians together and asked them to perform 
together. They were usually reluctant to do so (despite being eager to perform solo 
songs for each other), and on the few occasions when we could elicit concurrent 
performances, they were unable to coordinate. Our experience suggests that group 
musical performance in Tsimane’ culture is rare at best. 

The origins of the Tsimane’ music tradition are unknown, but they clearly have 
had some contact with the music traditions of other cultures for at least the past 
century, including exposure to Western musical instruments, primarily through 
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friars and missionaries as well as through Bolivian highlanders. It is also clear that 
the Tsimane’ musical tradition is slowly fading from their culture, in part due to 
the influence of missionaries. For instance, shamans were once a staple guardian 
of Tsimane’ songs, but were banned by Protestant missionaries who became influ- 
ential in parts of the Tsimane territory*". The past two decades have also seen the 
diffusion of highland Andean music and Christian hymns into the Tsimane' terri- 
tory through the radio station of Protestant missionaries, nearby commercial radio 
stations, and highland migrants moving into the lowlands. As part of a longitudinal 
study with the Tsimane we found that of the 2,549 adults interviewed annually 
during 2002-2010 in 13 villages along the Maniqui River (varying in proximity 
to market towns and roads), 30% reported owning at least one battery-operated 
radio. These figures may overstate the impact of radios because some of the radios 
lacked batteries, or were broken. It is also unclear to what extent the Tsimane’ use 
their radios to listen to music, as they are also a primary source of local news. More 
recently, the Bolivian government has started to expand the electricity grid to the 
countryside surrounding the town of San Borja, and as a result, some villages 
have begun to acquire electricity. Our impression is that the effect of these outside 
influences has been to reduce the prevalence of traditional Tsimane’ music in their 
culture. Most Tsimane’ that we interviewed had knowledge of the traditional songs, 
but this might well diminish in the future. 
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WConsonant [Dissonant [_]Unison 
Synthetic Chords 
US-M US - NM Capital Town Tsimane' 


Like ke kK kkKK 
Dislike 
Sung Chords 


Like RRK KKK * * * 
Dislike 


xxxx P<.0001 * P<.05 


Extended Data Figure 1 | Average ratings of individual chords from the —_ consonant (blue) and dissonant (brown) chords. Ratings are from 


synthetic and sung chord experiments from Study 1. Ten chords were 23 US musicians (US-M), 25 US non-musicians (US-NM), 24 Bolivian 
presented: the unison, minor second, major second, major third, perfect city-dwellers (capital), 26 Bolivian town-dwellers, and 64 Tsimane. 
fourth, tritone, perfect fifth, major seventh, major triad, and augmented Asterisks denote statistical significance of a repeated-measures ANOVA 


triad. The composite ratings plotted in Fig. 2 were averages of those for the —_ across all chord ratings. Data are mean and s.e.m. 
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US - Musicians Tsimane' 


MConsonant §§Dissonant []Unison 


a Synthetic Chords (two-note) 
: 
Dislike 
012345678 91011 012345678 91011 


Interval (semitones) 


b Sung Chords (two-note) 


Like ene woe 
Dislike 


012345678 91011 
Interval (semitones) 


c Sung Chords (three-note) 
: rm 
Dislike 
Maj Min Aug Dim Maj Min Aug Dim 
Triad 
d Vocal Harmonies (using Tsimane' songs) 
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ek n.s. 
Dislike 
$01234567 8 91011 $01234567 8 91011 


Interval (semitones) 


+xx% P<.0001 n.s. not significant 


Extended Data Figure 2 | Average ratings of individual chords and vocal _ chord experiments in which they exhibited significant variation). 


harmonies from Study 2. Rating variation for US listeners (n = 47) was ‘S denotes a single vocal phrase in the vocal harmonies experiment, 
largely determined by consonance and dissonance, whereas for Tsimane’ whereas ‘0’ denotes the unison condition (two concurrently presented 
listeners (n= 50) it was largely driven by interval size (in the two-note exemplars of the same phrase at the same pitch). Data are mean and s.e.m. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Pleasantness ratings of vocal harmonies 
from Study 2, averaged across pitch intervals or across song excerpts, 
for US musicians and Tsimane’. a, Ratings of song excerpts used to 
create harmonies (averaged across intervals). Each participant heard 
each excerpt twice, in each of two different randomly assigned interval 
conditions. The ratings of the two excerpt occurrences were averaged 
for each participant (47 US musicians and 50 Tsimane’). Top panels plot 
these mean ratings averaged across all participants. Asterisks denote 
statistical significance of Friedman's non-parametric test of differences 
among repeated measures across song excerpts. Ratings of US listeners 
varied across song excerpts (x7(25) = 40.92, P= 0.02), as did those of 
Tsimane listeners (x7(25) = 49.01, P=0.003), but the preferences of 
the two participant groups were not significantly correlated (r= —0.19, 
P=0.35). Bottom panels plot the mean ratings of each excerpt averaged 
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across the first and last halves of the participants tested. The mean song 
excerpt ratings of the two half-groups were significantly correlated for 
Tsimane’ but not for US participants, indicating that the preferences in US 
listeners were not reliable. b, Ratings of pitch intervals of the harmonies 
(averaged across song excerpts). Each participant heard each pitch interval 
four times, each time generated with a different song excerpt. The ratings 
of the four interval occurrences were averaged for each participant. Top 
panels (replicated from Extended Data Fig. 2) plot these mean ratings 
averaged across all participants. Asterisks denote statistical significance 
of a repeated-measures ANOVA across all chord ratings. Bottom panels 
plot the mean rating of each pitch interval averaged across the first and 
last halves of the participants tested. The mean interval ratings of the 

two groups were significantly correlated for US but not for Tsimane’ 
participants. Data are mean and s.e.m. 
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a One vs. Two Tone b = Sung Chords Sung Chords Vocal Harmonies 
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Extended Data Figure 4 | Discrimination and preference data for This subset of Tsimane’ listeners remained indifferent to dissonance, 
worst US and best Tsimane’ participant subsets from Study 2. rating consonant and dissonant sung intervals, triads, and harmonies 
a, Discrimination performance for the worst US-M participants (bottom similarly (t(15) < 1.2, P > 0.28 in all cases). By contrast, the Western subset 
third, n = 16) and best Tsimane’ participants (top third, n = 16), selected yielded significant consonant preferences (t(15) > 2.65, P< 0.05 in all 
based on performance in the onset asynchrony condition. This selection cases). These results suggest that the absence of a consonance preference 
criterion produced a group of Tsimane’ listeners who achieved an average cannot be explained by a lack of sensitivity to the underlying stimulus 
d@ of 2.0 on the mistuned condition—good performance in absolute distinction—the Tsimane’ were able to distinguish harmonic from 
terms, and only slightly worse than the poorly performing Western inharmonic tones despite not having a preference for one over the other. 
listeners. b, Pleasantness ratings for the subsets of participants from a for Data are mean and s.e.m. 


conventionally consonant and dissonant sung chords and vocal harmonies. 
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Extended Data Table 1 | Summary of participant group demographics for Study 1 
Participant Summary Table for Study 1 


US US LaPaz SanBorja Santa Maria 
(Musicians) (Non-musicians) (Capital) (Town) (Tsimane’ village) 
N total 23 25 24 26 64 
N female 19 12 14 15 31 
Mean age 20.2 37.9 29.9 31.6 31.5 
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Extended Data Table 2 | Summary of participant group demographics for Study 2 
Participant Summary Table for Study 2 


US Tsimane’ 
(Musicians) 
N total 47 50 
N female 32 27 
Mean age 25.9 27.4 
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Transfer of mitochondria from astrocytes to neurons 


after stroke 


Kazuhide Hayakawa!, Elga Esposito!, Xiaohua Wang!”, Yasukazu Terasaki!, Yi Liu?, Changhong Xing!, Xunming Ji? & Eng H. Lo! 


Neurons can release damaged mitochondria and transfer them 
to astrocytes for disposal and recycling’. This ability to exchange 
mitochondria may represent a potential mode of cell-to-cell 
signalling in the central nervous system. Here we show that 
astrocytes in mice can also release functional mitochondria that 
enter neurons. Astrocytic release of extracellular mitochondrial 
particles was mediated by a calcium-dependent mechanism 
involving CD38 and cyclic ADP ribose signalling. Transient 
focal cerebral ischaemia in mice induced entry of astrocytic 
mitochondria into adjacent neurons, and this entry amplified 
cell survival signals. Suppression of CD38 signalling by short 
interfering RNA reduced extracellular mitochondria transfer 
and worsened neurological outcomes. These findings suggest a 
new mitochondrial mechanism of neuroglial crosstalk that may 
contribute to endogenous neuroprotective and neurorecovery 
mechanisms after stroke. 

Astrocytes have broad roles in the central nervous system, and are 
involved in the regulation of neurodevelopment, neurotransmission, 
cerebral metabolism and blood flow”*. Normal astrocytes protect neu- 
rons against oxidative stress and excitotoxicity’. By contrast, unhealthy 
astrocytes may release deleterious factors that damage neurons®’. 
Healthy mitochondria may be essential for these neuroglial protective 
mechanisms because inhibition of astrocytic mitochondria makes neu- 
rons vulnerable to cell death!°. Mitochondria comprise the intracellular 
cores for energetics and viability!!, but under some conditions mitochon- 
dria might also be released into the extracellular space'”. For example, 
retinal neurons may transfer mitochondria to astrocytes for disposal 
and recycling’, and bone-marrow-derived stromal cells may transfer 
mitochondria into pulmonary alveoli to suppress acute lung injury!?. 

Here we investigated whether astrocytes can produce functional 
extracellular mitochondria to support neuronal viability after ischaemic 
stroke. Electron microscopy confirmed the presence of extracellular 
particles containing mitochondria in conditioned media from rat 
cortical astrocytes (Fig. 1a, Extended Data Fig. la). qNano analysis 
revealed that astrocyte-derived mitochondrial particles isolated by 
fluorescence-activated cell sorting (FACS) spanned a range of sizes 
from 300 to 1,100nm (Extended Data Fig. 1b—-d), and included pop- 
ulations that were positive for 31-integrin (79%) and CD63 (43%) 
(Extended Data Fig. 2). MitoTracker labelling suggested that these 
extracellular mitochondria may still be functional (Fig. 1b), and 
filtration of astrocyte-derived conditioned media through 0.2-m 
filters depleted the amounts of functional mitochondria and reduced 
measurements of mitochondrial ATP, membrane potential and oxygen 
consumption (Fig. 1b-e). 

An important question is whether extracellular mitochondria 
represent active signals or merely cellular debris. To address this, 
we investigated whether stimulated astrocytes could actively pro- 
duce extracellular mitochondria. CD38 catalyses the synthesis of a 
calcium messenger, cyclic ADP-ribose (cADPR), in mitochondrial 
membranes!*!, In the brain, CD38 is mainly expressed in glial cells, 


and may have a role in neuroglial crosstalk since astrocytes increase 
CD38 expression in response to glutamate release from neurons’®. On 
the basis of this background literature and the fact that most actively 
secreted cellular events involve calcium regulation, we decided to 
assess CD38-cADPR-calcium signalling as a candidate mechanism 
for the astrocytic production of extracellular mitochondria. First, we 
confirmed that rat cortical astrocytes expressed CD38 protein and 
contained CD38 or ADPR cyclase activity (Fig. 1f, g). We then used 
two methods to modify this pathway. When astrocytic CD38 was 
upregulated using CRISPR/Cas9 activation plasmids, the functional 
endpoints of extracellular mitochondria were significantly increased 
in conditioned media (Fig. 1h-k). When astrocytes were stimulated by 
cADPR to activate CD38 signalling, extracellular mitochondria were 
increased in conditioned media along with enhancement of functional 
endpoints in a calcium-dependent manner (Fig. 1]-n, Extended Data 
Fig. 3). Stimulation with cADPR did not appear to damage astrocyte 
viability (Fig. 10), suggesting that this release of extracellular mitochon- 
dria was not due to nonspecific cytotoxicity. 

If astrocytes can produce functional extracellular mitochondria, it 
is possible that these signals could affect adjacent neurons. When rat 
cortical neurons were subjected to oxygen-glucose deprivation, intra- 
cellular ATP levels fell and neuronal viability decreased, as expected 
(Fig. 2a—c, Extended Data Fig. 4a). When astrocyte-derived conditioned 
media containing extracellular mitochondrial particles was added 
to neurons, ATP levels increased and neuronal viability recovered 
(Fig. 2a—c, Extended Data Fig. 4a). However, when extracellular 
mitochondria were removed from the astrocyte-conditioned media, 
neuroprotection was no longer observed (Fig. 2a—c, Extended Data 
Fig. 4a). Similar results were obtained with immunostaining-based cell 
counts (Fig. 2d). As a control, ATP-liposomes were not significantly 
protective (Fig. 2e), suggesting that the entry of astrocytic mitochondria 
into neurons may generate additional benefits beyond ATP energetics 
per se. Fluorescent microscopy confirmed that astrocyte-derived 
mitochondria were present within treated neurons (Fig. 2f and 
Extended Data Fig. 4b). 

Beyond the prevention of acute neuronal death, delayed neuroplas- 
ticity is also important for stroke outcomes. CD38 may be important 
for brain plasticity because CD38-deficient mice show worsened 
recovery after brain injury’? and CD38 mutations may comprise 
risk factors for behavioural dysfunction'®. Hence, we asked whether 
CD38-mediated astrocyte-to-neuron mitochondrial transfer may also 
influence neuroplasticity. Neurons were labelled with the green 
fluorescent protein (GFP)-containing construct CellLight 
Mitochondria-GFP and astrocytes were separately labelled with the red 
fluorescent probe MitoTracker Red CMXRos, and the two cell types were 
co-cultured together for 24h. Confocal microscopy indicated that 
astrocyte-derived mitochondria were detected within soma and axon 
(Fig. 3a), and in these co-culture conditions, astrocytes supported 
neuronal survival after serum/glucose starvation in a CD38-dependent 
manner (Extended Data Fig. 5). When astrocytic mitochondria 


1Neuroprotection Research Laboratory, Departments of Radiology and Neurology, Massachusetts General Hospital and Harvard Medical School, Charlestown, Massachusetts 02129, USA. 
Cerebrovascular Research Center, Xuanwu Hospital, Capital Medical University, Beijing 100053, China. 


28 JULY 2016 | VOL 535 | NATURE | 551 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a b c 7 d e f ¢ 
y F -o ACM (9) 
) ACM. o P<0.05 0.404 P < 0.05 “e. mdACM (6) SF es 
} 36:5 %. Ee 1.2 pa w 
|g < ‘) I c 124 P<0.01 
; Fs © 0.35 ie} 
32 1.0 S = 
i gy, BE og eo g 
q ° § ~ 0.25 5 
' 5 0.6 8 : - 
i cu 
0.4 0.20 ° 
@ @ @ a 
H ; 
‘Mi rochondria 1 co o co & pa ; re | b-actin 
Mitochondria < < ime (min) 
g9 > , i j k ~©-Control plasmid (5) 
= 57 -O- Neuron () Rat cortical B 7, P<0.05 gq P< 0.05 -~@ CD38 activation plasmid (5) 
8 4 | -@ Astrocyte (6) astrocyte (~80%) = ‘ re) e a 
2 © 6 £ 3 = * P<0.01 
i % 1.5 
o 3 85 + Ee 2 a 
oO 5 o é 
ar Transfection > 4 + e R 2 E 13 
a 24h ina 3 . a 1. 
<1 CD38 cyclase activity a 3 8 4 7s 8 12 
8 y 2 Se £ Oo 14 
8 24h 8 i 
= teh ATP/O, 3 O Control plasmid (5) SS pe Sane 
; ; 9 ontrol_plasmi : Po HPO?A POS 
Time (min) consumption @ CD38 activation plasmid (5) g Time (min) 
t 100 m 0.54 P< 0.05 MN 495, P<0.01 © 150 
= P<0.05 s gS 
ae a § > 100 = 
5.8 ec S zZ 
== 60 5 0.3 Q = 
oo © —E 2a 
85 S x S © 
go 40 20.2 2 = 50 
cane) I re = 
a8 20 8 04 8 3g 
€ 0 7 oS ~O- Control (4) o- 
8) (3) (7) (2%) -@ cADPR (4) (4) (4) (4) (4) 
& & & 2% 094 ae 
; i Ss Ss « AY ND A 
Mitochondria Se SS < Time (min) cADPR (uM) 


Figure 1 | Astrocytic CD38 and extracellular mitochondria. 

a, Transmission electron microscopy (TEM) of extracellular mitochondria 
in astrocyte-derived conditioned medium (ACM). Scale bar, 500 nm. 

b, Rat cortical astrocytes were labelled with MitoTracker Red CMXRos. 
FACS showed that filtration through 0.2-j.m filters depleted levels of 
extracellular mitochondria in ACM (mitochondria-depleted ACM; 
mdACM). c-e, 0.2-\1m filters reduced several markers of extracellular 
mitochondrial function in ACM: extracellular ATP (c, n= 4 independent 
experiments), membrane potential (JC1 ratio; d, n= 4 independent 
experiments), and oxygen consumption (e, n = 3 biological replicates, 
n=9 or 6 independent experiments). f, Western blot confirmed higher 
CD38 levels in rat cortical astrocytes than in neurons. g, High and low 
levels of CD38 cyclase activity in astrocytes and neurons, respectively 
(n=2 biological replicates, n = 8 or 5 independent experiments). 

h, Experimental schematic for testing CRISPR/Cas9-mediated CD38 
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activation. NAD, 3-nicotinamide adenine dinucleotide sodium salt (200M). 
i, Twenty-four hours after transfection, CD38 cyclase activity was 
upregulated by CD38 activation plasmid (n = 2 biological replicates, n =5 
independent experiments). j, k, Extracellular ATP production (j) and 
oxygen consumption (k) were significantly increased by CD38 activation 
(n=2 biological replicates, n = 5 independent experiments). 1, FACS 
showed that extracellular mitochondria were increased by cADPR (11M) 
stimulation in astrocytes (n = 3 independent experiments). m, cADPR 
(1M) increased extracellular mitochondria membrane potential at 24h 
(n= 2 biological replicates, n = 7 independent experiments). n, Oxygen 
consumption in extracellular mitochondria was increased by cADPR 

(n= 2 biological replicates, n = 4 independent experiments). 0, cADPR did 
not cause astrocyte toxicity (n = 2 biological replicates, n = 4 independent 
experiments). Data are mean + s.e.m. P values are from an unpaired t-test 
(c, d, i, j, 1, m) or two-way analysis of variance (ANOVA) (e, k, n). 


Figure 2 | Astrocytic extracellular mitochondria and neuroprotection. 
a, Experimental schematic to test neuroprotective effects of ACM or 
mdACM against oxygen-glucose deprivation (OGD) in rat cortical 
neurons. WST, water-soluble tetrazolium salt dye for cell viability assay. 

b, ACM but not mdACM rescued ATP levels in damaged neurons 

(n= 2 biological replicates, n = 4 independent experiments). c, ACM but 
not mdACM recovered neuronal viability after OGD (n= 2 biological 
replicates, n = 4 independent experiments). d, Immunostaining confirmed 
that neuroprotective effect of ACM but not mdACM (n= 2 biological 
replicates, n = 4 independent experiments). NeuN and MAP2 are neuronal 
markers. Scale bars, 100,1m. e, No statistically significant neuroprotection 
with liposomal ATP (1-1,000 nM) after OGD. f, Fluorescent microscopy 
suggests the presence of astrocyte mitochondria (labelled with 
MitoTracker Red CMXRos, 200 nM) within neurons. Scale bars, 100|1m. 
Data are mean + s.e.m. P values are from a one-way ANOVA followed 

by Tukey’s test. 
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Figure 3 | Astrocytic mitochondria and neuroplasticity after ischaemic 
stress. a, Confocal microscopy revealed that astrocytic mitochondria 
(red, MitoTracker Red CMXRos) may be transferred into neural soma 

(1) and axon (2), and some may fuse with neuronal mitochondria 

(3; green, CellLight Mitochondria-GFP). b, Experimental schematic 

for co-culture studies. c, Soma size was unchanged but astrocytic 
mitochondrial density in neuronal soma was significantly decreased 
when CD38 was suppressed in astrocytes (n = 54 or 41 soma from n =2 
biological replicates, n = 3 independent experiments were counted). Scale 
bars, 201m. d, Quantification of dendrite elongation (MAP2 staining) 
(n=2 biological replicates, n= 5 or 6 independent experiments). e, Male 
C57BL/6J mice were subjected to 60 min transient focal ischaemia. Three 
days later, astrocyte mitochondrial particles (1,000 particles per 2 \1l, 
MitoTracker Red CMXRos) were infused into cerebral cortex. Z-stack 


were made dysfunctional via inhibition of mitochondrial aconitase, 
cADPR-stimulated astrocytes no longer supported neuronal survival 
and axonal extension (Extended Data Fig. 6). To assess our hypoth- 
esis further, we asked whether this ability of astrocytes to transfer 
mitochondria could in fact enhance neuroplasticity under pathological 
conditions. Control or CD38-silenced astrocytes were co-cultured with 
surviving neurons after oxygen-glucose deprivation (Fig. 3b). Short 
interfering RNA (siRNA) suppression of CD38 in astrocytes reduced 
mitochondrial transfer (Fig. 3c) and dendrite regrowth after injury 
(Fig. 3d). 

Taken together, these cellular findings appear consistent with the 
overall hypothesis that CD38 signalling may help astrocytes to transfer 
mitochondria into neurons and promote survival and plasticity after 
injury. But it is unclear whether this mechanism works in vivo. To address 
this question, we turned to a mouse model of focal cerebral ischaemia. 
First, primary mouse cortical astrocyte cultures were labelled with 
MitoTracker Red CMXRos and extracellular mitochondrial particles 
were collected. Mice were then subjected to focal cerebral ischaemia, 
and 3 days later, extracellular mitochondrial particles were directly 
injected into the peri-infarct cortex. After 24h, immunostaining 
suggested that transplanted astrocytic mitochondria were present in 
neurons (Fig. 3e). Next, we turned to FVB/N-Tg (GFAPGFP)14Mes/J 
transgenic mice in which astrocytes are fluorescently labelled. When 
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images showed transplanted astrocytic mitochondria (red) within peri- 
infarct neurons at 24h. f, FVB/N-Tg (GFAPGFP)14Mes/J transgenic mice 
with fluorescently labelled astrocytes were subjected to 30 min transient 
focal ischaemia. Immunohistochemistry at 24h suggested that GFP 
(GFAP)-positive particles co-stained with mitochondrial TOM40 were 
present in MAP2-positive neurons in peri-infarct cortex. g, Western blot 
indicated that GFP-positive neurons upregulated cell-survival-related 
proteins (phosphorylated-AKT, BCL-XL) but not apoptosis-related 
proteins (caspase 3, AIF) along with an increase in mitochondrial TOM40 
(n=3 mice). Isolated neurons expressed mature (neurofilament) but not 
neural stem-cell markers (nestin) (see Extended Data Fig. 7f). Data are 
mean + s.e.m. P values are from an unpaired t-test (c) or one-way ANOVA 
followed by Tukey’s test (d). 


these mice were subjected to focal cerebral ischaemia, fluorescent 
mitochondrial particle signals appeared within adjacent neurons at 
24h after stroke (Fig. 3f). Neurons that were collected from ischaemic 
peri-infarct cortex via flow cytometry showed a general upregulation of 
cell-survival-related signals such as phosphorylated AKT and BCL-XL, 
along with an increase in the mitochondria marker TOM40 (Fig. 3g, 
Extended Data Fig. 7). 

Finally, we attempted loss-of-function experiments to determine 
whether blocking CD38 signalling results in worsened outcomes after 
stroke. In our mouse model of focal cerebral ischaemia, CD38 was 
upregulated in the peri-infarct cortex (Extended Data Fig. 8a). At 5 
days after stroke, Cd38 siRNA or control siRNA was injected into 
cerebral ventricles (Fig. 4a). By 2 days after siRNA injections, total 
CD38 expression in the peri-infarct cortex was successfully downregu- 
lated (Extended Data Fig. 8b). There were no clear differences in infarct 
area or the total levels of GFAP-positive reactive astrocytes (Fig. 4b, c), 
but astrocyte subsets that expressed CD38 were significantly 
decreased without affecting the number of other CD38-expressing 
cells such as CD8 T cells and microglia/macrophages!? (Fig. 4c, 
Extended Data Fig. 8c-g). To assess the levels of extracellular mito- 
chondrial particles in this in vivo model, flow cytometry was used 
to analyse cerebrospinal fluid (CSF). GFAP-positive mitochondria 
were detected in CSF, and Cd38 siRNA injections appeared to reduce 
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Figure 4 | Effects of Cd38 siRNA on focal cerebral ischaemia. a, Male 
C57BL/6J mice were subjected to transient 60 min focal ischaemia, 

and control or Cd38 siRNA was injected into lateral ventricles 5 days 

after stroke. MCAO, middle cerebral artery occlusion. b, Nissl staining 
showed no difference in infarct size (n = 8 or 10 mice per group). NS, not 
significant. c, Immunostaining demonstrated that astrocytic CD38 was 
diminished by Cd38 siRNA. d, Astrocytic CD38 suppression by siRNA 
reduced GFAP-positive mitochondria in CSF at 7 days (n = 6 mice per 
group). e, Neuronal mitochondria were decreased by Cd38 siRNA 

(n=8 or 5 mice per group). f, Cd38 siRNA attenuated peri-infarct GAP43 
immunostaining. g, Western blot confirmed a reduction of peri-infarct 
GAP43 protein within Cd38-siRNA-treated brains (n =5 mice per group). 


this extracellular population of astrocyte-derived mitochondria 
(Fig. 4d). At the same time, flow cytometry was used to quantify 
levels of MAP2 neuronal mitochondria (Extended Data Fig. 9). Brains 
treated with Cd38 siRNA showed a significant reduction in neu- 
ronal mitochondria (Fig. 4e), suggesting that interfering with CD38 
signalling may have suppressed endogenous astrocyte-to-neuron 
mitochondrial transfer. These effects were accompanied by a reduc- 
tion in peri-infarct GAP43 (a surrogate marker of neuroplasticity; 
Fig. 4f, g) as well as worsened neurological outcomes (Fig. 4h, i). 
Furthermore, CD38 suppression significantly decreased oxygen con- 
sumption measurements in CSF-derived extracellular mitochondrial 
particles (Fig. 4j), and neurological outcomes seemed to be negatively 
correlated with these functional endpoints (Fig. 4k, 1), suggesting 
that CSF mitochondrial function may be a potential biomarker of 
neuroglial signalling after stroke. 

Taken together, these findings suggest that astrocytes may release 
extracellular mitochondrial particles via CD38-mediated mechanisms 
that enter neurons after stroke (Fig. 4m). However, there are a few caveats 
and the detailed mechanisms and generalizability of these proof- 
of-concept findings should warrant further investigation. First, the 
dynamics of extracellular mitochondria release and entry into neurons 
as well as quantitative thresholds for functional benefit remain to be 
fully defined (Extended Data Fig. 10a-i). A second caveat relates to 
mitochondrial entry mechanisms. In neurons, endocytosis may be 
regulated by dynamin/clathrin”’ or integrin pathways”". In our models, 
integrin-mediated Src/Syk signalling may be involved (Extended 
Data Fig. 10j-m). How integrin-mediated mitochondrial transfer 
is modulated under different disease conditions requires further 
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C, contralateral hemisphere; I, ipsilateral hemisphere. h, i, Suppression 
of CD38 signalling worsened neurological outcomes in neurological 
severity scores (neuroscore) (h) and grid-walking test (i) (n=7 or 9 mice 
per group). *P < 0.05 versus day 3 control siRNA; #P < 0.05 versus day 7 
Cd38 siRNA. j, CD38 suppression decreased oxygen consumption in CSF 
mitochondria (n =7 or 9 mice per group). k, 1, Mitochondrial function 
in CSF was negatively correlated with neurological outcomes. Data are 
mean + s.e.m. P values are from an unpaired t-test (b, d, e, i), one-way 
ANOVA followed by Tukey’s test (g), repeated-measures two-way ANOVA 
followed by Bonferroni test (h), two-way ANOVA (j), or Spearman's rank 
correlation (k, 1). m, Schematic of CD38 regulation of mitochondria 
release/transfer hypothesis between astrocytes and neurons. 


study. Third, CD38 is also expressed in immune cells. In this study, 
CD38 suppression by siRNA in vivo did not appear to affect T cells or 
microglia/macrophages, but the balance between potentially beneficial 
CD38 signals in astrocytes versus deleterious CD38 signals in immune 
cells should be carefully considered. A fourth caveat is whether other 
glial cells may participate. Microglia, oligodendrocytes and pericytes 
are activated after stroke?*”’, so their potential roles in mitochondrial 
exchange warrant further investigation. Finally, astrocytes can pro- 
duce many factors for protecting and restoring neurons, including 
tPA, high-mobility group box 1 (HMGB1), extracellular microvesicles 
containing VEGF and FGF-2, and various microRNAs7* 2’. How mito- 
chondrial particles may interact with these other extracellular signals 
should be explored. 

Non-cell autonomous signalling is vital for central nervous system 
recovery after injury or disease”*”?, In the context of cerebral ischae- 
mia, the present study suggests that astrocytes may release extracellular 
mitochondrial particles that enter neurons to support cell viability and 
recovery after stroke. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

No statistical methods were used to predetermine sample size. 

Reagents. BAPTA-AM (A1076), cADPR (C7344), B-nicotinamide adenine 
dinucleotide sodium salt (N0632) and dynasore hydrate (D7693) were purchased 
from Sigma, and RGDS peptide (3498) and MNS (2877/50) were from R&D systems. 
Mouse focal cerebral ischaemia model. All experiments were performed 
following an institutionally approved protocol in accordance with National 
Institutes of Health guidelines and with the United States Public Health Service's 
Policy on Human Care and Use of Laboratory Animals. Our methods also 
included randomization, blinding and statistical criteria consistent with ARRIVE 
guidelines (Animals in Research: Reporting In vivo Experiments). In brief, male 
C57BL/6J (12-14 weeks) or FVB/N-Tg (GFAPGFP) 14Mes/J mice are anaesthe- 
tized with 5% to 1% isoflurane, and rectal temperatures and cerebral blood flow are 
monitored. After midline skin incision, 7-0 nylon monofilament coated with 
silicon resin was introduced through a small incision into the common carotid 
artery. Adequate cerebral ischaemia was assessed by Laser Doppler flowmetry 
and by examining forelimb flexion after the mice recovered from anaesthesia. The 
mice were reanaesthetized, and reperfusion was established by withdrawal of the 
filament. Functional outcome after stroke was assessed by neurological severity 
scores and the foot-fault test”. 

Primary neuron cultures. Primary neuron cultures were prepared from cerebral 
cortices of embryonic day (E)17 Sprague-Dawley rat embryos or E17 FVB/N-Tg 
(GFAPGFP)14Mes/J mouse embryos. In brief, cortices were dissected and disso- 
ciated using papain dissociation system (Worthington Biochemical Corporation, 
LK003150). Cells were spread on plates coated with poly-p-lysine (Sigma, P7886) 
and cultured in DMEM (NBM, Life Technology, 11965-084) containing 25 mM 
glucose, 4mM glutamine, 1 mM sodium pyruvate, and 5% FBS at a density of 
2x 10° cells ml“! (1 ml for 12-well format, 0.5 ml for 24-well format). At 24h after 
seeding, the medium was changed to Neurobasal medium (Invitrogen, 21103-049) 
supplemented with B-27 (Invitrogen, 17504044) and 0.5 mM glutamine. Cells were 
cultured at 37°C in a humidified chamber of 95% air and 5% CO>. Cultures were 
used for experiments from 7 to 10 days after seeding. 

Primary astrocyte cultures. Primary astrocyte cultures were prepared from 
cerebral cortices of 2-day-old neonatal Sprague-Dawley rats or E17 C57BL/6J 
mice. In brief, dissociated cortical cells were suspended in DMEM (Life Technology, 
11965-084) containing 25 mM glucose, 4mM glutamine, 1 mM sodium pyruvate, 
and 10% FBS and plated on uncoated 25 cm’ flasks at a density of 6 x 10° cells cm~?. 
Monolayers of type 1 astrocytes were obtained 12-14 days after plating. Non- 
astrocytic cells such as microglia and neurons were detached from the flasks by 
shaking and removed by changing the medium. Astrocytes were dissociated by 
trypsinization and then reseeded on uncoated T75 flasks. After the cells reached 
70-80% confluence, cultures were switched to Neurobasal medium containing 
1% penicillin/streptomycin or DMEM containing 1% penicillin/streptomycin, 
and astrocyte-conditioned media (ACM) were collected 24h later. Collected ACM 
was treated by spin cell debris down with centrifuging at 2,000g for 10 min or by 
filtrating through a 1.2-\1m syringe filter for further experiments. 

OGD and reoxygenation. OGD experiments were performed using a specialized, 
humidified chamber (Heidolph, incubator 1000, Brinkmann Instruments) kept at 
37°C, which contained an anaerobic gas mixture (90% No, 5% Ho, and 5% CO). 
To initiate OGD, culture medium was replaced with deoxygenated, glucose-free 
DMEM (Life Technology, 11966-025). After 2h challenge, cultures were removed 
from the anaerobic chamber, and the OGD solution in the cultures was replaced 
with maintenance medium. Cells were then allowed to recover for 18h (for 
neurotoxicity assay) and 72h (for siRNA/astrocyte coculture) in a regular incubator. 
Cell viability assays. Neuronal injury was measured by standard cell cytotoxicity 
assays such as lactate dehydrogenase (LDH) using the Cytotoxicity Detection Kit 
(Roche Applied Science, 11644793001) and/or Cell Counting Kit 8 cytotoxicity 
assay (DOJINDO, CK04-13). For the LDH assay, 100% cell death was induced with 
0.5% Triton X-100 in sister culture. The relative assessments of neuronal injury 
were normalized by comparison with 100% cell death (LDH assay) or with control 
cell as 100% cell survival (CCK8). 

Determination of CD38/ADPR-cyclase activity. ADPR cyclase activity was 
determined fluorometrically using nicotinamide guanine dinucleotide (NGD*) 
(Sigma, N5131) as a substrate as described previously*”*!. Astrocytes or neurons 
were incubated with 200\1M NGD*, and the production of cGDPR was determined 
at excitation/emission wavelengths of Ex 300nm/Em 410 nm with a microplate 
reader. 

ATP measurement. Intracellular or extracellular ATP was determined by 
CellTiter-Glo luminescence kit (Promega, G7570), which can perform cell lysis 
and generate a luminescent signal proportional to the amount of ATP present. In 
brief, opaque-walled 96-well plates with culture media (5011) or cell lysate (50 il) 
were prepared. CellTiter-Glo luminescence test solution (5011) was added and 


incubated for 30 min at room temperature. Luminescent signal was determined 
by luminescence microplate reader. 

Liposomal ATP treatment. Liposomal ATP was obtained from Encapsula 
NanoScience. In brief, we used lyophilized proliposomes compose of 7:3 molar 
ratio of L-c-phosphatidylcholine: L-a-phosphatidylserine containing ATP which 
forms 100 nm liposomal ATP upon hydration. ATP-loaded liposome (1-1,000nM) 
was co-incubated with neurons following OGD, and cell viability was analysed 
after 18h reoxygenation. 

Mitochondria membrane potential measurement. To monitor mitochondrial 
health, JC-1 dye (Invitrogen, T-3168) was used to assess mitochondrial membrane 
potential. Rat cortical astrocytes or media were incubated with JC1 (541M or 11M) 
for 30 min at 37°C. JC1 dye exhibits potential-dependent accumulation in mito- 
chondria, indicated by fluorescence emission shift from green (Ex 485 nm/Em 
516nm) to red (Ex 579 nm/Em 599 nm). Mitochondria membrane potential was 
determined by the fluorescent ratio with a fluorescent microplate reader. 
Oxygen consumption analysis. Real-time oxygen consumption in astrocytic par- 
ticles or in CSF samples was measured by the Mito-ID Extracellular O2 sensor kit 
(Enzo Life Science, ENZ-51045) according to the instruction provided Enzo Life 
Science. In brief, astrocytes (70-80% confluent cells per well per 10011) or particle 
fractions (100.1; 25-fold concentrated astrocytic conditioned media) were pre- 
pared in non-coated regular 96 wells, and O2 sensor probe (10,11) was added into 
each well. Each CSF sample (8-20 i) was collected from cisterna magna at day 7 
after focal cerebral ischaemia. After centrifugation at 2,000g for 10 min, 6 jl CSF 
was diluted in 54,11 PBS, and 611 O2 sensor probe was added into each well. After 
covering with 100 il (501 for CSF sample) of Mito-ID HS Oil, the plates were read 
with filter combination of 340 nm for excitation and 642 nm of emission at 30°C. 
Electron microscopy analysis. Rat cortical astrocytes or pellets from astrocyte- 
conditioned media were fixed in 2.0% glutaraldehyde in 0.1 M sodium cacodylate 
buffer, pH 7.4 (Electron Microscopy Sciences) for 1h at room temperature on a 
rocker. They were rinsed in cacodylate buffer, gently scraped and pelleted and 
post-fixed in 1.0% osmium tetroxide in cacodylate buffer for 1h on ice. They were 
rinsed in buffer and stabilized with a small amount of 2% agarose in PBS to hold 
them together. They were then dehydrated through a graded series of ethanol to 
100%, followed by propylene oxide, 100%. They were infiltrated with Epon resin 
(Ted Pella) in a 1:1 solution of Epon:propylene oxide overnight on a rocker at room 
temperature. The next day they were placed in fresh Epon for several hours and then 
embedded in Epon overnight at 60°C. Thin sections were cut on a Leica EM UC7 
ultramicrotome, collected on formvar-coated grids, stained with uranyl acetate and 
lead citrate and examined in a JEOL JEM 1011 transmission electron microscope 
at 80kV. Images were collected using an AMT digital imaging system (Advanced 
Microscopy Techniques). These methods are similar to previous descriptions of 
extracellular particle and mitochondria detection in astrocyte cultures”. 

FACS analysis. Standard FACS analysis was performed by BD LSR I] or BD 
Fortessa as described previously*****?, ACM was collected from rat cortical 
astrocytes labelled with MitoTracker Red CMXRos followed by filtrating through 
a 1.2-\1m syringe filter. The supernatant was used to sort labelled mitochondria 
fraction by FACSAriall cell sorter configured with 561 nm air cooled laser. Brain 
cells were collected from peri-infarct cortex after stroke. In brief, tissues are gently 
minced and then digested at 37°C for 30 min with an enzyme cocktail (collagenase 
type I, DNase I, Sigma Aldrich). CSF samples were prepared for further staining 
after centrifugation at 2,000g for 10 min. FACS analysis was performed using an 
unstained or phenotype control for determining appropriate gates, voltages, and 
compensations required in multivariate flow cytometry. 

Measurement of particle size. Particle size following extracellular mitochondria 
isolation by FACS was determined by qNano (iZON). Nanopore-based detection 
allows particle-by-particle assessment of complex mixtures. Optimization of pore 
size to particle size, by adjusting the stretch of the pore, allows highly accurate 
measurement™. Particles containing mitochondria were sorted using FACS anal- 
ysis, then particle sizes were quantified by NP400 and using CPC400 calibration 
particle. 

Western blot analysis. Western blot was performed as previously reported”>. Pro- 
PREP Protein Extraction Solution (iNtRON Biotechnology, 17081) or Qproteome 
FFPE Tissue kit (Qiagen, 37623) was used to collect samples. Each sample was 
loaded onto 4-20% Tris-glycine gels. After electrophoresis and transferring to 
nitrocellulose membranes, the membranes were blocked in Tris-buffered saline 
containing 0.1% Tween 20 and 0.2% I-block (Tropix, T2015) for 90 min at room 
temperature. Membranes were then incubated overnight at 4°C with the following 
primary antibodies, anti-6-actin (1:1,000, Sigma-Aldrich, A5441), anti-GFAP 
antibody (1:1,000, BD Biosciences, 556328), anti- MAP2 antibody (1:500, Abcam, 
ab11267 or ab32454), anti-CD38 antibody (1:500, Santacruz, sc-7049), anti- 
TOM4O0 (1:200, Santacruz, sc-11414), anti-phosphorylated AKT (1:500, Cell 
Signaling, 9271), anti-BCL-XL (1:500, Cell Signaling, 2764), anti-active caspase 3 
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(1:200, Abcam, ab32042), anti-AIF (1:500, Abcam, ab32516), and anti-GAP43 (1:500 
Santacruz, sc-17790). After incubation with peroxidase-conjugated secondary 
antibodies, visualization was enhanced by chemiluminescence (GE Healthcare, 
NA931 (anti-mouse), NA934 (anti-rabbit), or NA935 (anti-rat)). Optical density 
was assessed using the NIH Image analysis software. 

Immunocytochemistry and immunohistochemistry. Immunocytochemistry 
and immunohistochemistry were performed as described previously*>”*. After 
staining with primary antibody and fluorescent-tagged secondary antibody, nuclei 
were counterstained with or without 4,6-diamidino-2-phenylindole (DAPI), and 
coverslips were placed. Immunostaining images or time lapse images were obtained 
with a fluorescence microscope (Nikon ECLIPSE Ti-S) interfaced with a digital 
charge-coupled device camera and an image analysis system or confocal micro- 
scope analysis using Carl Zeiss Laser Scanning Confocal Microscope Pascal 5 LSM 
and Pascal 5 LSM software Version 3.2. Dendrite elongation was assessed following 
MAP? staining followed by NeuriteQuant analysis*”. 

CRISPR activation plasmid transfection. Control CRISPR activation plasmid 
(sc-437275), and rat CD38 CRISPR activation plasmid (sc-437321-ACT) were 
obtained from Santa Cruz Biotechnology. Transfection was performed according 
to the transfection protocol for cell cultures from Santa Cruz Biotechnology. In 
brief, 1 ml of plasmid transfection reagent mixture (transfection reagent, sc-395739, 
transfection medium, sc-108062) was co-incubated with astrocytes for 24h in a 5% 
CO), incubator at 37 °C, and then CD38 cyclase activity was assessed to confirm 
efficiency of transfection. 

siRNA experiments. Control siRNA and Cd38 siRNA were obtained from Santa 
Cruz Biotechnology. Control siRNA (sc-37007) consists of a scrambled sequence 
that will not lead to the specific degradation of any known cellular mRNA. Mouse 
Cd38 siRNA (sc-37246) or rat Cd38 siRNA (sc-270394) is each pool of 3 target- 
specific 19-25-nucleotide siRNAs designed to knock down gene expression. The 
sequences for mouse Cd38 siRNAs are: 5'‘-GUGUACUACCAACAUUCAA-3’, 
5'-GUGUGUCUUUAGUAGGUAU-3’, 5’-CCAGUUUGUGAUUGUUGA-3’. 
Rat Cd38 siRNAs are: sequence 1: 5‘-CUCAAACCAUACCAUGUAA-3’, 
sequence 2: 5‘-GGAAGAGCUUCCCAAUACA-3’, sequence 3: 5/-GUGUUAU 
CGUCUAGCAAUA-3’. 
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siRNA was prepared according to the transfection protocol for cell cultures 
from Santa Cruz Biotechnology. In brief, 1 ml of siRNA transfection reagent 
mixture (transfection reagent, sc-29528, transfection medium, sc-36868) was 
co-incubated with astrocytes for 6h in a 5% CO) incubator at 37 °C, and then 
the same amount of DMEM 20% FBS was added. An additional incubation was 
performed for 18h. 
Statistical analysis. Results were expressed as mean + s.e.m. When only two 
groups were compared, an unpaired t-test was used. Multiple comparisons were 
evaluated by one-way ANOVA followed by Tukey’s test, two-way ANOVA or 
repeated-measure two-way ANOVA followed by Bonferroni test. Correlation was 
evaluated by Spearman’s rank correlation. P < 0.05 was considered statistically 
significant. Statistical analyses were performed using GraphPad Prism 6. 
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Extended Data Figure 1 | Astrocytic mitochondria particle detection. 
a, Electron microscopic analysis demonstrated that mitochondria 

were detected within extracellular astrocyte-derived particles. Free 
mitochondria were also found in ACM. b, In FACS analysis, control beads 
were used to gate populations ranging in size from 500 to 900 nm. ¢, In 
ACM, approximately 53% of particles in the size range were positive for 
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functional mitochondria (n = 2 biological replicates, n =5 independent 
experiments). d, After FACS analysis to isolate the extracellular 
mitochondria fraction from ACM, particle size was measured by qNano 
analysis. Consistent with electron microscopy analysis, a range of size 
distributions was observed (~25%: 300-400 nm, ~75%: 400-1,100 nm). 
Data are mean + s.e.m. 
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Extended Data Figure 2 | Characteristics of astrocytic mitochondrial respectively (n = 2 biological replicates, n = 4 independent experiments). 
particles in FACS analysis. a, Mitochondrial particles were identified cADPR (11M) did not appear to affect these distributions (n = 2 biological 
by FACS. b, Of these mitochondrial particles, FACS analysis showed that replicates, n = 4 independent experiments). Data are mean + s.e.m. 


approximately 79% and 43% of particles express 31-integrin and CD63, 
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independent experiments). b, Intracellular ATP levels in astrocytes were e, cCADPR-induced extracellular ATP levels within extracellular particles 
upregulated by cADPR stimulation (n = 4 independent experiments). were diminished by the intracellular calcium blocker, BAPTA-AM 
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Extended Data Figure 4 | Summary of experiment in Fig. 2c. 

a, We repeated the experiment in Fig. 2c with n = 4 independent 

primary cultures per group. Similar results were obtained (n = 2 
biological replicates, n = 4 independent experiments). The extracellular 
mdACM group was significantly different compared to the ACM group. 
Furthermore, in this repeated experiment, there was also statistical 
significance between controls (OGD-damaged neurons alone) versus 
those treated with mitochondria-containing astrocyte media (ACM), and 
there was no statistically significant worsening when comparing control 
versus mitochondria-depleted groups (mdACM). Taken together, these 
two separate experiments suggest a modest but statistically significant 
neuroprotection induced by astrocyte-derived mitochondria. Data are 
mean + s.e.m. one-way ANOVA followed by Tukey’s test. b, MitoTracker 
Red CMXRos (200 nM) was incubated without astrocytes to obtain 
no-cell-derived media (negative control). Media was collected and further 
incubated with neurons after OGD. After 24h, there was no mitochondrial 
signal observed. Scale bars, 100m. 
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Extended Data Figure 5 | Role of astrocytic CD38 in mitochondria 
transfer during starvation in vitro. a, Immunocytochemistry in 
neuron-astrocyte co-cultures demonstrated that CD38 was primarily 
expressed within astrocytes. b, Extracellular ATP levels were higher in 
media collected from neurons co-cultured with astrocytes compared 

to media from neuron-alone cultures (1 = 3 biological replicates, n =9 

or 11 independent experiments). c, After serum/glucose starvation, 
neurons were significantly damaged, as expected. But neurons co-cultured 
with astrocytes were protected (n = 2 biological replicates, n =6 or 4 
independent experiments). d, CD38 suppression by siRNA significantly 
decreased extracellular ATP levels in neuron-astrocyte co-cultures, but 
CD38 suppression did not affect extracellular ATP levels in neuron-alone 
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cultures (n = 3 biological replicates, n = 9 or 6 independent experiments). 
e, Blockade of astrocytic CD38 by siRNA significantly increased LDH 
release (indicative of cell damage) in the co-cultures, suggesting that CD38 
may be important for maintaining neuroglial homeostasis (1 = 2 biological 
replicates, n = 6 independent experiments). f, Rat primary neurons were 
co-cultured with rat astrocytes. Immunocytochemistry showed that CD38 
suppression by siRNA reduced astrocytic mitochondria (red) transfer into 
neurons compared to control. g, h, Western blot analysis indicated that 
CD38 suppression by siRNA can be successfully performed in astrocyte 
culture without affecting cell viability (n = 2 biological replicates, n= 4 

or 3 independent experiments). Data are mean + s.e.m. P values are from 
an unpaired t-test. 
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neuronal cell death and prevents neurite outgrowth in vitro. transferred dysfunctional mitochondria into neurons (n =3 independent 
a, Astrocytic aconitase was inhibited by fluorocitrate (FC), which experiments). f, Metabolically disrupted astrocytes could not support 
disrupted astrocyte metabolism that was accompanied by a senescence- neural viability under starvation in the co-culture (n = 4 independent 
associated (- galactosidase (SA-3-gal) signal. b, Intracellular ATP was experiments). g, Co-culture between astrocytes and neurons was 
decreased in these metabolically disrupted astrocytes (n = 2 biological conducted for 48 h to test neurite outgrowth. Immunocytochemistry 
replicates, n = 6 independent experiments). *P < 0.05, **P < 0.01 showed that metabolically disrupted astrocytes prevented neurite 
versus 0 mM fluorocitrate. c, Propidium iodide (PI) staining showed outgrowth and increased neuronal cell death (n = 3 independent 
that fluorocitrate (0.5 mM) did not induce cell death in astrocytes. experiments). h, The LDH assay indicated that fluorocitrate (0.5 mM) did 
d, Metabolically disrupted astrocytes significantly decreased the not affect cell viability in either rat cortical astrocytes (n = 4 independent 
mitochondrial membrane potential. Red: aggregated JC1; green: experiments) or rat cortical neurons (n= 4 independent experiments). 
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transferred mitochondria, which had a high-membrane potential 
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Extended Data Figure 7 | FACS analysis using E17 FVB/N-Tg c, Representative image before cell sorting. d, Purity after cell sorting. 
(GFAPGFP) 14Mes/J transgenic mice. a, Cortical neurons were e, The MAP2* GFP~ or MAP2* GFP* populations were 92.5% or 85.9% 
isolated from E17 FVB/N-Tg (GFAPGFP)14Mes/J transgenic mice. positive for DAPI, respectively. f, Western blot analysis demonstrated 
Immunocytochemistry showed that cultured neurons did not express that both GFP-positive and -negative neurons expressed mature neuron 
either GFP or GFAP protein after OGD, suggesting that stroke-like stress marker (neurofilament) but not neuronal stem-cell marker (nestin). These 
may not lead to ‘leakiness’ in this astrocyte-specific GFP mouse. b, Brain data exclude the possibility that GFAP-positive cells included subsets of 


cell suspension was prepared from FVB/N-Tg (GEAPGFP)14Mes/J mice neuronal precursor cells that are known to also express GFAP. 
subjected to transient ischaemia, then FACS analysis was performed. 
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Extended Data Figure 8 | Effects of CD38 suppression by siRNA 

in vivo and in vitro. a, Western blot showed that CD38 expression was 
increased in peri-infarct cortex at days 1 to 7 after stroke. b, Cd38 

siRNA or a scrambled control was injected into lateral ventricles at 5 days 
after stroke. Western blot analysis confirmed that CD38 expression was 
successfully decreased in peri-infarct cortex at 7 days. ¢, In peri-infarct 
cortex, CD8 T-cell and Ibal-positive microglia/macrophages were 
detected by immunohistochemistry. d, Quantification of the number of 
CD8-positive or Ibal-positive cells indicated that there was no difference 
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between control siRNA or Cd38 siRNA (n= 6 mice per group). Data 
are mean +s.e.m. e, Cultured rat cortical astrocytes were subjected to 
OGD for 2h followed by treating with control siRNA or Cd38 siRNA. 
Astrocyte cell morphology or GFAP expression was assessed by 
immunocytochemistry or western blot after 22 h reoxygenation. 

f, Morphology change was not clearly observed in cultured astrocytes 
treated with Cd38 siRNA compared to control siRNA. g, Western 
blot analysis showed that CD38 was successfully decreased by siRNA 
transfection but GFAP expression was not changed. 
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Extended Data Figure 9 | Neuronal purity confirmed by FACS analysis from C57BL/6J mice. These comparisons confirmed that the MAP2* 
in vivo. To confirm our FACS findings, we used two different published population did not contain any appreciable amounts of microglia or 
standard approaches***?. a, By FACS, the MAP2-positive population astrocytes, whereas another neuron marker (NeuN) was highly enriched. 
was gated and further assessed by markers such as Ibal (microglia/ b, Similar findings were obtained using an alternative gating method to 
macrophage) and GFAP (astrocyte) in brain cell samples isolated isolate neurons. 
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Extended Data Figure 10 | Involvement of integrin-mediated Src/Syk 
signalling mechanisms in astrocytic mitochondrial entry into neurons 
in vitro. a, b, Cultured rat cortical astrocytes were stimulated by cADPR 
(1 4M) for 24h. Intracellular mitochondria labelled by MitoTracker dye 
was significantly increased in astrocytes stimulated with cADPR (1 1M) 
(n=2 biological replicates, n = 7 independent experiments). **P < 0.01 
versus 0h. c, Some mitochondria were found outside of cells. d, FACS 
analysis revealed that approximately 5 x 10° mitochondria were 
contained in 1 ml ACM. cADPR (1M) significantly increased the 
number of mitochondria in the media (n = 2 biological replicates, n =6 
independent experiments). e, Experimental schedule to quantify the entry 
of mitochondria into neurons after OGD. Rat cortical neurons (1 x 10° 
cells per well) were prepared in 24-well culture plates. ACM or cADPR- 
ACM (each 1 ml) was co-incubated with neurons for 18h. Mitochondrial 
entry into neurons was calculated by mitochondrial intensity measured 
before and after washing cells with PBS. Phenol-red-free culture media 
was used to decrease the background signal. The background signal was 
subtracted from the fluorescent intensity obtained from each sample. 

f, OGD for 2h decreased approximately 50% of the mitochondria in 
neurons after 18h reoxygenation (n= 2 biological replicates, n = 4 
independent experiments). g, Data are expressed as relative values, with 
total neuronal mitochondria after 2h OGD/18h reoxygenation being 
100%. Mitochondrial entry into neurons was slightly higher after cADPR- 
ACM treatment (18%) compared to ACM treatment (11%), although 
there was no statistical significance (n = 2 biological replicates, n = 4 
independent experiments). h, There was no difference in the percentage 


of mitochondrial entry after ACM treatment or cADPR-ACM treatment 
(n= 2 biological replicates, n = 4 independent experiments). i, cADPR- 
ACM treatment supported neuronal viability better than ACM treatment 
(n=2 biological replicates, n = 4 independent experiments). j, Co-culture 
between rat cortical astrocytes in the upper chamber and rat cortical 
neurons in the lower chamber was performed for 18h after OGD for 

2h in neurons. Mitochondrial entry into neurons was then measured. 

k, Immediately after OGD, dynasore (5}1M), RGDS peptide (H-Arg-Gly- 
Asp-Ser-OH; 501g ml~ 1), or MNS (3,4-methylenedioxy-{-nitrostyrene; 
141M) was initially added to neurons for 30 min, then astrocyte 
co-culture was performed for 18h. Data are expressed as relative values, 
with astrocytic extracellular mitochondria plus entered mitochondria 
into neurons being 100%. RGDS peptide and MNS significantly 
decreased mitochondrial entry into neurons, but dynasore did not 
inhibit the entry. 1, MNS treatment significantly decreased astrocyte- 
mediated neuroprotection (n= 2 biological replicates, n = 4 independent 
experiments). m, Dynasore (5|1M), RGDS peptide (50 jg ml"), or 

MNS (1M) did not affect neuronal viability after 2h OGD (n=4) 

Data are mean +s.e.m. These data suggest that the entry of astrocytic 
mitochondrial particles into neurons may involve integrin-mediated Src/ 
Syk signalling mechanisms. However, we acknowledge that these pathways 
may be multifactorial, and deeper analyses are warranted to dissect entry 
mechanisms under various physiological and pathological conditions. 

P values are from a one-way ANOVA followed by Tukey’s test (a, i, k, 1) 
or an unpaired t-test (d, g). 
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HIV-1 antibody 3BNC117 suppresses viral rebound in 
humans during treatment interruption 
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Interruption of combination antiretroviral therapy in HIV-1- 
infected individuals leads to rapid viral rebound. Here we report the 
results of a phase Ila open label clinical trial evaluating 3BNC117, 
a broad and potent neutralizing antibody against the CD4 binding 
site of the HIV-1 Env protein', during analytical treatment 
interruption in 13 HIV-1-infected individuals. Participants 
with 3BNC117-sensitive virus outgrowth cultures were enrolled. 
Results show that two or four 30 mgkg~! 3BNC117 infusions, 
separated by 3 or 2 weeks, respectively, are generally well tolerated. 
Infusions are associated with a delay in viral rebound of 5-9 weeks 
after two infusions, and up to 19 weeks after four infusions, or 
an average of 6.7 and 9.9 weeks, respectively, compared with 
2.6 weeks for historical controls (P < 0.00001). Rebound viruses 
arise predominantly from a single provirus. In most individuals, 
emerging viruses show increased resistance, indicating escape. 
However, 30% of participants remained suppressed until antibody 
concentrations waned below 201g ml“ !, and the viruses emerging 
in all but one of these individuals showed no apparent resistance to 
3BCN117, suggesting failure to escape over a period of 9-19 weeks. 
We conclude that the administration of 3BNC117 exerts strong 
selective pressure on HIV-1 emerging from latent reservoirs during 
analytical treatment interruption in humans. 

A fraction of HIV-1-infected individuals develops broad and 
potent serologic activity against the virus. Single-cell antibody cloning 
methods? have uncovered the source of this activity as broadly 
neutralizing antibodies (bNAbs), which target different sites on the 
HIV-1 envelope spike protein, gp160 (refs 1-3). 

In animal models, bNAbs show potent prophylactic activity, suppress 
established viraemia, and delay viral rebound during analytical 
treatment interruption (ATI)**®. In humans, a phase I clinical trial 
showed that 3BNC117 is generally safe and effective in transiently 
reducing viraemia in chronically HIV-1-infected individuals’. A 
single infusion of 3BNC117 was well tolerated, rapidly decreased viral 
loads in viraemic individuals by an average of 1.48 logio copies per ml, 
with durable activity for 4 weeks”. In addition, 3BNC117 increased 
autologous antibody responses in HIV-1-infected individuals, and 
enhanced clearance of infected cells in humans and in humanized 
mice!!! VRCO1, a less potent bNAb that also targets the CD4-binding 
site, suppressed viraemia by 1.14 logo (refs 12, 13 and Fig. 1a, b). 

To investigate whether 3BNC117 can suppress viral rebound from the 
latent reservoir during ATI in chronically suppressed HIV-1 infected 


1,14 


humans, we conducted a phase IIa open label clinical trial. To select 
participants with 3BNC117-sensitive viruses in their latent reservoirs, 
we performed bulk viral outgrowth cultures of peripheral blood 
mononuclear cells (PBMCs) from individuals whose viraemia was 
suppressed by combination antiretroviral therapy (ART). The resulting 
isolates were screened for sensitivity to 3BNC117 using the TZM-bl 
assay (Supplementary Table 1). Of 63 individuals screened, only 11% 
yielded viruses that were fully resistant to 3BNC117 (ICs9 > 20 1g/ml), 
and 65% were sensitive to 3BNC117 ICs9 at concentrations below 
2.0 jg/ml. In contrast only 29% were similarly sensitive to VRCO1 
(Fig. 1a and b, Extended Data Fig. 1 and Supplementary Table 1). 

We enrolled HIV-1 infected individuals who were on suppressive 
antiretroviral therapy (ART) with plasma viral loads <50 HIV-1 RNA 
copies per ml for at least 12 months, had CD4 counts >500 cells per mm, 
yielded 3BNC117-sensitive outgrowth viruses (ICs < 2.0 ug ml”}), 
and whose viral load at screen was <20 copies per ml (Extended Data 
Fig. 1, Supplementary Tables 2-4, and Methods). Participants were 
enrolled in two groups: eight in group A to receive two 30 mgkg"! 
infusions three weeks apart, while seven in group B received up to four 
30mgkg™! infusions at two-week intervals (Fig. 1c, d, Supplementary 
Table 2). Two group A participants had viral loads >20 copies per 
ml at the time of infusion and were excluded from further analysis 
(Supplementary Tables 2 and 4). Participants are numbered 701-715 
(Supplementary Table 2). 

ATI was started 2 days after the first 3B NC117 infusion. ART was 
reinitiated and infusions were stopped after two consecutive plasma 
viral load measurements exceeded 200 copies per ml. All individuals 
on non-nucleoside reverse transcriptase inhibitors (NNRTIs) were 
switched to an integrase-inhibitor-based regimen (dolutegravir plus 
tenoforvir disoproxil fumarate/emtricitabine) four weeks before ATI 
owing to the long half-life of NNRTIs (Supplementary Table 2). 

Both dosing regimens were generally well tolerated. The majority 
of reported adverse events were transient and grade 1 in severity 
(Supplementary Table 5). The mean CD4 T-cell count at baseline (day 
0) was 747 cells per mm’, and the average change in CD4 T-cell counts 
between start of ATI and rebound was —127 cells per mm?. Although 
CD4 T cells declined modestly during viral rebound in some partici- 
pants, CD4 T-cells returned to baseline by week 12 in most participants 
(mean 828 cells per mm?) (Extended Data Fig. 2 and Supplementary 
Table 4). Of 12 individuals tested, 5 showed measurable increases in 
the magnitude and/or breadth of T cell responses to HIV-1 12 weeks 
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Figure 1 | 3BNC117 neutralization coverage, 
trial design and pharmacokinetics of 3BNC117 
in HIV-1-infected individuals during ATI. 

a, b, Sensitivity of virus outgrowth cultures from 
63 ART suppressed individuals to 3BNC117 and 
VRCO1 (Supplementary Table 1). The y-axis 
shows the fraction of viral outgrowth culture 
supernatants neutralized by a given antibody 
concentration (x-axis) in Tzm-bl assays. Red 
line indicates cut-off IC59 (2g ml~') for 
participation in the trial. c, d, Diagrammatic 
representation of study groups A and B 
respectively. 3BNC117 infusions indicated by the 
red arrows, and sampling for PK and virologic 
studies indicated below. Numbers indicate study 
weeks. e, f, 3BNC117 levels as determined by 
ELISA for group A (n=6, left panel, red), group 
B (n=7, right panel, red (n= 4), black (n=2) 
and purple (n =1)), HIV-1 negative (n = 3, blue) 
and viraemic individuals (n= 6, green). Curves 
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indicate mean 3BNC117 levels, error bars indicate 
standard deviation. Arrows indicate 3BNC117 
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after ATI, relative to baseline (Extended Data Fig. 3). None of the par- 
ticipants experienced acute retroviral syndrome during rebound, and 
viraemia was re-suppressed below 20 copies per ml in all participants 
within 2-7 weeks after restarting ART (Supplementary Table 4). We 
conclude that up to four 30 mgkg™! infusions of 3BNC117 during ATI 
are generally safe and well tolerated. 

By anti-idiotype ELISA? the half-life of 3BNC117 during ATI was 
19.6 days among group A participants, and 14.1 days among those 
in group B (Fig. le, f and Supplementary Table 4). These measure- 
ments are similar to previously reported values for 3BNC117 in HIV-1- 
infected individuals on ART? (Fig. le). 

All six group A participants maintained viral loads below 200 
copies per ml during the first 4 weeks, with rebound 5-9 weeks after 
ART interruption (Fig. 2a and Supplementary Table 4a). In group B, 
rebound occurred 3-19 weeks after ATI, with four out of seven (57%) 
participants remaining suppressed for at least 10 weeks (Fig. 2b and 
Supplementary Table 4b). The average time to rebound was 6.7 weeks 
in group A, 9.9 weeks in group B, and 8.4 weeks for all participants 
together, compared with 2.6 weeks for matched historical non-infused 
control individuals (Fig. 2c, Extended Data Fig. 4a, Supplementary 
Tables 4, 6 and 7). Altogether, 6 of the 13 infused individuals (46%) 
remained suppressed until at least 9 weeks after ATI. Relative to 
matched historical control individuals, the delay to rebound among 
all 3BNC117-infused participants was highly significant (P < 0.00001 
weighted log-rank test, Fig. 2c, Extended Data Fig. 4, Supplementary 
Tables 4, 6 and 7 and Methods). We conclude that repeated infusions 
of 3BNC117 are generally safe and significantly delay HIV-1 rebound 
from the latent reservoir during ATI. 

Time to viral rebound did not correlate with pre-ATI viral culture 
sensitivity to 3BNC117, nor to baseline levels of cell-associated HIV-1 
DNA (Fig. 2d and Extended Data Fig. 4e). Therefore the signif- 
icance of viral outgrowth sensitivity as an inclusion criterion is not 
clear. 3BNC117 levels at rebound were also variable, ranging from 
6-168 gm’, but directly correlated with the ICgo of the emerging 
virus (Fig. 2a, b, e). 

To determine whether rebound was associated with resistance to 
3BNC117, we compared pre-infusion and rebound viral outgrowth 
cultures. A majority (8/13) of participants had rebound viruses that 
were more resistant to 3BNC117 (ICgp > threefold higher, Fig. 3a, c, 
Extended Data Fig. 5a, Supplementary Table 3). Among group A 


participants, all but one (707) had more resistant rebound viruses; 
however, among group B participants, four of seven (710, 711,712 and 
715) showed similar pre-infusion and rebound sensitivity to 3BNC117 
(Fig. 3a, c, Extended Data Fig. 5a, Supplementary Table 3). Among 
these five individuals, 711 was the earliest to rebound at 3 weeks, despite 
having viruses that were surprisingly sensitive to 3BNC117 as measured 
by ICso (Fig. 2b, Extended Data Fig. 5a, Supplementary Tables 3 and 4). 
However, 100% neutralization was not achieved against 711 rebound or 
pre-infusion viruses, even at high (501g ml7') antibody concentrations, 
suggesting that 3BNC117 was not fully therapeutic (Extended Data 
Fig. 5a, Supplementary Table 3). Thus, the only participant in the study 
to rebound within 3 weeks of ATI may have done so because of pre- 
existing resistance to 3BNC117 by the dominant virus in the reservoir. 

The other four participants that showed no change between pre- and 
post-infusion culture sensitivity to 3B NC117, 707, 710, 712, and 715 
rebounded relatively late at 9, 19, 16 and 11 weeks after ATI, respectively 
(Figs 2a, b, 3a, c, Extended Data Fig. 5, Supplementary Tables 3 and 4). In 
all of these individuals rebound was associated with relatively low anti- 
body concentrations ranging from 6-41 j1gml7! (mean 19.7 1g mI“). 
This antibody concentration represents 9.6-fold the mean ICgo for the 
rebounding viruses, which is consistent with previous reports on the 
relationship between suppressive 3BNC117 concentration and neutrali- 
zation titre in macaques" (Fig. 2, Extended Data Fig. 5, Supplementary 
Tables 3 and 4). 

To determine whether viral rebound during ATI was associated with 
resistance to other bNAbs undergoing clinical testing, we examined 
sensitivity to 10-1074 (ref. 15), which targets a different and non- 
overlapping epitope on the HIV-1 trimer (Fig. 3b, d, Extended Data 
Fig. 5, and Supplementary Table 3). With the exception of 703 and 711, 
the participants’ rebound cultures did not show increased resistance 
to 10-1074. We conclude that rebound during ATI in the presence 
of 3BNC117 is infrequently associated with increased resistance to 
10-1074. 

To characterize viruses emerging from the latent reservoir further, 
we performed single genome sequencing (SGS) of viral RNA from 
the plasma and viral outgrowth cultures from eight individuals. 
Phylogenetic analysis of these sequences indicated that all of these eight 
trial participants were infected with epidemiologically unrelated viruses 
(Extended Data Fig. 6). Given the limited sampling of the pre-infusion 
reservoir, rebound viruses did not always fall within the radiation 
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Figure 2 | Delay in viral rebound in the presence of 3BNC117. 
a, b, Plasma viral loads and 3BNC117 levels in group A and group B 
participants respectively. 3B NC117 infusions are indicated with red 
arrows. The left y-axis shows plasma viral loads in RNA copies per ml 
(black curves), and right y-axis shows antibody levels measured by ELISA 
(red curves). Average rebound time point (2.6 weeks, Supplementary 
Table 6) in 52 ACTG trial participants who underwent ATI without 
antibody treatment”? is shown with dotted lines. Grey areas indicate ART 
therapy. c, Kaplan-Meier plot summarizing viral rebound in 52 ACTG 
trial participants who underwent ATI without antibody treatment (black, 
Supplementary Table 6), and the combination of all 13 participants (red) 
who underwent ATI with 3BNC117 infusions. The y-axis indicates the 
percentage of participants with viral levels below 200 RNA copies per ml, 
x-axis indicates weeks after ATI initiation. The P value is based ona 


of pre-infusion viral isolates (Fig. 3e, f, Extended Data Figs 7 and 8, 
Supplementary Figs 1 and 2). 

Remarkably, in five of eight participants, all rebounding virus 
sequences clustered within a low diversity lineage, consistent with the 
clonal expansion of a single recrudescent virus (Fig. 3e, f, Extended 
Data Figs 7 and 8, Supplementary Table 8). These data contrast with 
individuals undergoing ATI in the absence of antibody infusion, where 
virus rebound is consistently polyclonal, indicating the activation 
of multiple latently infected cells!*!°. Thus, in addition to delaying 
rebound, 3BNC117 appears to restrict the outgrowth of viral genotypes 
from the latent reservoir. 

Six of eight participants sequenced had rebound viral outgrowth 
culture and/or plasma sequences that indicated 3BNC117 resistance. 
For example, in 704, all rebound viruses carried a serine at position 
456 (Fig. 4a and Supplementary Figs 1 and 2), which may disrupt a 
highly conserved salt bridge that maintains the V5 loop’s position 
and conformation”””. Similarly, in 708 and 709, nearly all rebound 
viruses carried atypical residues at position 282, where a lysine residue 
typically forms a salt bridge with 3BNC117 (Fig. 4a and Supplementary 
Figs 1 and 2)?3. However, documented 3BNC117 resistance mutations”+ 
were not universally identified among rebound viral strains (Fig. 4a 
and Supplementary Figs 1 and 2). Only a minor fraction (3 of 23) of 
sequences in the rebound population of participant 701 had potential 
resistance-conferring residues in Loop D (274F, 282R), while the 
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bootstrap version of the weighted log-rank test adjusting for the potential 
confounders ‘years on ART, and ‘age’ (Supplementary Table 7, Methods 
Statistical Analyses). d, Dot plot indicating the relationship between 
3BNC117 sensitivity of pre-infusion outgrowth cultures at screening 
(y-axis, ICgo in pg ml~!) and the week of rebound (x-axis). Group A (n= 6) 
and group B (n= 7) participants are coloured red and blue respectively. 
The P value was derived from calculating the Pearson correlation 
coefficients. e, Dot plot indicating the relationship between 3BNC117 
sensitivity of rebound outgrowth cultures (y-axis, ICgo in jug ml~’) 

and the 3BNC117 serum concentration at rebound (x-axis, in j1g ml~!). 
704, 708, 709 and 713 did not reach ICgp at the concentrations tested and 
were assigned a value of 22 1g ml~!. Group A (1 =6) and group B (n=7) 
participants are coloured red and blue, respectively. The P value was 
derived from calculating the Pearson correlation coefficients. 


remaining sequences did not (Fig. 4a and Supplementary Figs 1 and 2). 
Similarly, in 702 and 703, only a subset of rebound viruses carried a 
putative resistance-conferring A281D change’. Nevertheless, the 
frequency of this change increased markedly with time in both par- 
ticipants, indicating continued selection for 3BNC117 resistance (Fig. 
4a and Supplementary Figs 1 and 2). For participants 707 and 711, 
no sequence features were identified that would indicate 3BNC117 
resistance. 

To determine the sensitivity of rebound viruses to 3BNC117, 
we performed TZM-bI neutralization assays using pseudoviruses 
typed with SGS Env genotypes (Fig. 4b, Extended Data Figs 7 and 8, 
Supplementary Figs 1 and 2 and Supplementary Table 9). With the 
exception of participant 707, who rebounded 9 weeks after ATI at very 
low 3BNCI117 titres, Env genotypes at rebound were more resistant 
to 3BNC117 than pre-ATI (Fig. 4b, Extended Data Figs 7 and 8 and 
Supplementary Table 9). We conclude that viral rebound during ATI 
in the presence of 3BNC117 selects for the emergence of resistant 
variants, indicating strong selection pressure by this antibody on viral 
populations arising from the reservoir. 

Antibody potency and half-life are directly correlated with HIV-1 
prophylaxis in pre-clinical models. For example, VRCO1, a CD4bs anti- 
body that is less potent than 3BNC1 171, is less effective than 3BNC117 
in preventing SHIVapg infection in macaques*”>. Consistent with these 
observations, clinical trials with combinations of three less-potent 
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Figure 3 | Viral rebound during 
ATI and 3BNC117 treatment. 
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and purple bars represent ICgp titres 


8 s 
RG 


701 702 703 704 707 708 709 710 711 712 713 714 715 701 


* 
20 
10 


702 703 704 707 708 709 710 711 712 713 714 715 


against rebound outgrowth cultures 
* from the indicated time points. 
Asterisks indicate cultures failing 
to reach an ICgp up to 20g m1. 


** ke 


<% 
7 

= & @ | 
<a 


c, d, Difference between rebound 
I and pre-infusion culture ICgo titres 
(from a, b) for 3BNC117 (c) or 


20 Lt 
LE LEE LE GF LE _ LP H Wh L 
Vinal 
AS ATES Pr KS SE ee ROO ae we ee 
e 
701 704 


< < 


707 


LE LE _L LE O&O LC OP 
Ak hs L. 
BES SSE GSS 


cyvy 


a 


e 
600005, 00 geeeeess socceee loees [ee 


709 


a 


60055050000, ,0,000°° * 
eo! 
id st. 0 
e[ eee eceee [eee 
ee 
oereeeeegegee 


OF OS 10-1074 (d). For cultures failing to 
AEE ROE Aw reach an ICgp up to 20ppgml-!, a 

value of 20 jg ml“! was assigned, 
and such cultures are marked 


2 
SPS GS 


708 


as with an asterisk. e, f, Clonality 


of the rebound virus. Maximum 

e likelihood phylogenetic trees 

e comparing pre-ATI single 

‘ genome derived env sequences 
(blue and orange) to rebound 
plasma env sequences (green and 
pink) are shown for participants 
whose rebound comprised single 
(e) versus multiple (f) viruses 
(Supplementary Table 8). Pre-ATI 
culture sequences were inferred as 
described in the Methods section 
‘Statistical analyses. 


ggeeeeenge' 
e e 


eqeesce*o 
e e 


Screening culture virus 
Day 0 culture virus 

Time point 1 plasma SGS 
Time point 2 plasma SGS 


0.005 


AN S 
RS Sor NAS 


-@ Pre-infusion 
~® Rebound 

(major genotype) 
— Rebound 

(minor genotypes) 


a Loop D Loop D b 
ve MGFDINT |e HGMTHNTL 
We Q { wo | S 
nM = ST 58 
702 - ie 2 
vo IDV jr RIOT 
703) w5 kt 704) wW5 
we ) we 


SANS 0 
sere INKS 


Figure 4 | 3BNC117 resistance in rebound viruses. a, Logogram shows 
env gp120 regions (amino acid positions; 270-285 and 455-467, according 
to HXBc2 numbering) indicating sequence changes from pre-infusion 
culture(s) (first row) to rebound sequences derived from plasma SGS at 
the indicated time points. The frequency of each amino acid is indicated 
by its height. Red residues represent mutations predicted to affect 
neutralization*”. b, 3BNC117 neutralization sensitivity of pseudoviruses 
derived from pre-infusion or rebound SGS. Black lines represent 
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pre-infusion virus envs; red lines represent the major env at rebound 

for each participant (Extended Data Figs 7 and 8, Supplementary 

Tables 8 and 9 and Methods); grey lines represent minor rebound envs in 
participants with multiple rebound viruses or variants that evolved after 
rebound (Extended Data Figs 7 and 8, Supplementary Table 9). Symbols 
reflect the means of two technical replicates; error bars denote standard 
deviation. 
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first-generation bNAbs showed limited effects on viral rebound in 
the setting of ATI in chronically infected individuals”*’’, In addition, 
selective pressure as evidenced by escape mutations was only observed 
for one of the three antibodies used in the combination, 2G12?°?”. In 
contrast, 3BNC117 alone significantly delayed viral rebound with nearly 
half of all individuals remaining below 200 copies per ml until at least 
9 weeks, including four individuals who failed to develop resistance and 
only rebounded at low antibody concentrations. We speculate that the 
difference in efficacy between 3BNC117 and less potent bNAbs in the 
setting of ATI is due to increased potency and/or a longer half-life’, 

Nevertheless, the majority of the individuals we studied rebounded at 
high 3BNC117 serum concentrations. A single viral genotype display- 
ing increased resistance to 3BNC117 established rebound in most cases. 
These viruses represent pre-existing dormant variants that emerged 
from the latent reservoir. The time to rebound did not correlate with 
the amount of viral DNA in circulating PBMCs; however, this is a poor 
measure of the HIV-1 reservoir, since most integrated proviruses in 
patients on ART are defective”*. Instead, the delay in viral rebound may 
represent a measure of the frequency of 3BNC117-resistant variants in 
the latent reservoir. 

Combinations of drugs are needed to maintain HIV-1 suppression 
in effective ART regimens. Similarly, combinations of antibodies were 
required to suppress viraemia in humanized mice’. We speculate that 
combinations of bNAbs will also be needed to increase the frequency of 
individuals that remain suppressed by antibody during ATI. 

Whether 3BNC117 can also impact the size and composition of the 
latent reservoir during ATI will require additional studies. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Study design. An open-label, dose-escalation phase 2a study was conducted in 
HIV-1-infected participants (http://www.clinicaltrials.gov; NCT02446847). Study 
participants were enrolled sequentially according to eligibility criteria. Group A 
received 3BNC117 on days 0 and 21 at a dose of 30 mg/kg body weight at a rate 
of 250 ml/hour. Group B received 3BNC117 on days 0, 14, 28 and 42 at a dose of 
30 mg/kg, as long as viral rebound did not occur. Antiretroviral therapy (ART) was 
discontinued 2 days after the first 3B NC117 infusion (day 2). Plasma HIV-1 RNA 
levels were monitored weekly, and ART was resumed when viral load increased 
to >200 c.p.m. in two consecutive weekly measurements. 

Study participants were followed for 36 weeks after the first infusion. Safety 
data are reported until week 36 for participants enrolled in group A and until 
week 14 for participants enrolled in group B. All participants provided written 
informed consent before participation in the study and the study was conducted 
in accordance with Good Clinical Practice. The protocol was approved by the 
Federal Drug Administration in the USA and the Institutional Review Board at 
the Rockefeller University. 

Study participants. All study participants were recruited at the Rockefeller 
University Hospital, New York, USA. Eligible participants were adults aged 
18-65 years, HIV-1-infected and before enrolment had plasma HIV-1 RNA levels 
<50 c.p.m. for at least 12 months while on combination ART and <20 c.p.m. at the 
screening visit, and current CD4 count >500/j1L. In addition, participant-derived 
HIV-1 isolates produced by co-culture of participant PBMCs with HIV-uninfected 
donor PBMCs were required to be neutralized by 3BNC117 with an ICs» <2 g/ml 
in TZM-bI neutralization assays, as previously described*!. An ICso of <2 ug/ml 
was chosen as a cut-off based on previous PK data of 3BNC117 in humans? 
and data in macaques showing that antibody levels 10-100 times the ICso value 
against infecting viral strains are necessary to control viral rebound'. However, 
given the limited diversity and representation of the latent reservoir in outgrowth 
cultures (Supplementary Fig. 2) and the fact that no correlation between pre- 
infusion IC5o and delay of viral rebound was found in this study, the significance 
of this criterion is unclear. Participants on an NNRTI-based ART regimen were 
switched to a study-provided integrase-inhibitor-based regimen (dolutegravir 
(Tivicay, ViiV Pharmaceuticals) + tenofovir disoproxil fumarate/emtricitabine 
(Truvada, Gilead Sciences) 4 weeks before treatment interruption due to the 
prolonged half-life of NNRTIs. Exclusion criteria included history of CD4 nadir 
<200 cells/jul, concomitant hepatitis B or C infections, previous receipt of a mon- 
oclonal antibody of any kind, or clinically relevant physical findings, medical 
conditions or laboratory abnormalities. Pregnant and breastfeeding women were 
not eligible. 

Historical controls (ACTG trial participants). Viral rebound data from 
52 participants who participated in four ACTG ATI studies without additional 
interventions (ACTG 371°”, A5024**, A5068*4, and A5197°7) were compared 
with viral rebound data in this study. Historical controls were selected based on 
similar inclusion criteria: age 18-65, Plasma HIV-1 RNA <50 c.p.m. for at least 
12 months before ATI while on combination ART, CD4 count at time of ATI 
>500 cells/jil, CD4 nadir >200 cells/l, weekly viral load measurements at least 
until viral rebound occurred. 

Study procedures. The appropriate volume of 3BNC117 was calculated according 
to body weight, diluted in sterile normal saline to a total volume of 250 ml, and 
administered intravenously over 60 min. Study participants received 3BNC117 
on days 0 and 21, or 0, 14, 28, and 42 and remained under monitoring at the 
Rockefeller University Hospital for 4h after each infusion. Participants returned 
for frequent follow up visits for safety assessments, which included physical 
examination, measurement of clinical laboratory parameters such as haematology, 
chemistries, urinalysis, and pregnancy tests (for women). Plasma HIV-1 RNA levels 
were monitored weekly during the ATI period and CD4 counts were measured 
every other week (Supplementary Table 4). Study investigators evaluated and 
graded adverse events according to the DAIDS AE Grading Table and determined 
causality. Blood samples were collected before and at multiple times after 3BNC117 
infusions. Samples were processed within 4h of collection, and serum and 
plasma samples were stored at —80°C. PBMCs were isolated by density gradient 
centrifugation. The absolute number of peripheral blood mononuclear cells was 
determined by an automated cell counter (Vi-Cell XR; Beckman Coulter), and cells 
were cryopreserved in fetal bovine serum plus 10% DMSO. 

ART re-initiation criteria. Antiretroviral therapy was discontinued 2 days after 
the first 3BNC117 infusion (day 2). ART was re-initiated when HIV-1 RNA levels 
were found to be >200 c.p.m. and/or CD4 T cell counts decreased to <350 cells/j1l 
and the result was confirmed with a repeat measurement. 
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Plasma HIV-1 RNA Levels. HIV-1 RNA levels in plasma were measured at the 
time of screening (within 49 days before the first infusion), day 0 (before infusion), 
and weekly until week 12, then at weeks 14, 24 and 36. Participants that remained 
virologically suppressed to <20 c.p.m. off ART beyond week 12, returned for 
weekly measurements of plasma HIV-1 RNA levels. HIV-1 RNA levels were 
determined using the Roche COBAS AmpliPrep/COBAS TaqMan HIV-1 Assay, 
Version 2.0, which detects between 20 and 1 x 10” c.p.m. This assay was performed 
at LabCorp. 

CD4* and CD8* T cells. CD4* and CD8* T-cell counts were determined at 
screening, on day 0 (before infusion), and weeks 2, 3, 4, 6, 8, 10, 12, 14, and 36 
by a clinical flow cytometry assay, performed at LabCorp. Cells were analysed by 
flow cytometry. Leukocytes were determined as CD45* cells. Percentage of cells 
positively stained for CD3, CD4, CD8 as well as the CD4/CD8 ratio were analysed 
with the BD Multiset software (BD Biosciences). 

3BNC117 study drug. 3BNC117 is a recombinant, fully human IgG1« mAb 
recognizing the CD4 binding site on the HIV-1 envelope’. The antibody was cloned 
from an HIV-1-infected viraemic controller in the International HIV Controller 
Study’, expressed in Chinese hamster ovary cells (clone 5D5-5C10), and purified 
using standard methods. The 3BNC117 drug substance was produced at Celldex 
Therapeutics Fall River (MA) GMP facility, and the drug product was fill-finished 
at Gallus BioPharmaceuticals (NJ). The resulting purified 3BNC117 was supplied 
as a single use sterile 20 mg/ml solution for intravenous injection in 8.06 mM 
sodium phosphate, 1.47 mM potassium phosphate, 136.9mM sodium chloride, 
2.68 mM potassium chloride, and 0.01% polysorbate 80. 3BNC117 vials were 
shipped and stored at 4°C. 

Measurement of 3BNC117 serum levels. Serum levels of 3BNC117 were 
determined by a validated sandwich ELISA at Celldex Theapeutics as described 
previously’. Plates (Sigma-Aldrich PN: CLS3590 96-well, High Bind, polystyrene) 
were coated with 4\1g/ml of an anti-idiotypic antibody specifically recognizing 
3BNC117 (anti-ID 1F1 mAb), and incubated overnight at 2-8 °C. After washing, 
plates were blocked for 1h with 5% BSA. Serum samples, QCs and standards 
were added (1:50 minimum dilution in 5% BSA) and incubated for 1h at room 
temperature. 3BNC117 was detected using an HRP-conjugated mouse anti- 
human IgG kappa chain specific antibody (Abcam PN: ab79115) and the HRP 
substrate tetra-methylbenzidine. 3BNC117 concentrations were then interpolated 
from a standard curve of 3BNC117 using a four-parameter logistic curve-fitting 
algorithm. The reference standard and positive controls were created from the drug 
product lot of 3BNC117 used in the clinical study. 

Pharmacokinetic analysis. Blood samples were collected immediately before and 
at the end of infusions as well as on the day after infusion, weekly during the ATI 
period and at weeks 14, 24 and 36. 3BNC117 serum levels were measured by ELISA 
(Celldex Therapeutics). 

Neutralization assay. Serum samples, viral supernatants, and control antibodies 
were tested against HIV-1 envelope pseudoviruses as previously described*™”. 
Cell-associated HIV-1 DNA. Participant’s CD4* T-cells were isolated from 
10 million cryopreserved PBMCs by negative magnetic selection (Miltenyi). Total 
DNA was extracted and quantitative PCR performed using pol- and CCR5-directed 
primers as previously described’. 

Virus cultures. Autologous virus was retrieved from HIV-1 infected individuals 
as previously described*!. Briefly, healthy donor PBMCs were obtained by 
leukapheresis from a single donor. Cells were cultured at a concentration of 
5 x 10°/ml in Iscove’s Modified Dulbecco’s Medium (IMDM; Gibco) supplemented 
with 10% fetal bovine serum (FBS; HyClone, Thermo Scientific), 1% penicillin/ 
streptomycin (Gibco), and 1 j1g/ml phytohemagglutinin (PHA; Life Technologies) 
at 37 °C and 5% CO . After 2-3 days, 5 x 10° CD8* depleted cells were transferred 
into IMDM supplemented with 10% FBS, 1% penicillin/streptomycin, 5 1g/ml 
polybrene (Sigma), and 100 U/ml of IL-2. Cells were then co-incubated with 
4-8 x 10° CD4* T cells from the study participants and 10 million irradiated 
healthy donor PBMCs that had been cultured together for 24 h prior in IMDM 
supplemented with 10% FBS, 1% penicillin/streptomycin, 100 U/ml IL-2 and 
1j1g/ml PHA at 37°C and 5% CO». Lymphoblasts were replenished weekly by 
adding 3 million healthy donor PHA stimulated CD8* depleted lymphoblasts. 
Culture supernatants were quantified using the Alliance HIV-1 p24 Antigen 
ELISA kit (PerkinElmer) according to the manufacturer’s instructions. TCIDsos 
were determined for all HIV-1 containing supernatants*®*’ and then tested for 
sensitivity against 3BNC117 and other bNAbs in a TZM.bI neutralization assay. 
Blood samples and leukapheresis were collected under separate IRB-approved 
protocols and after volunteers provided informed consent. 

Sequence analysis. HIV-1 RNA extraction and single genome amplification 
was performed as described previously**. In detail, HIV-1 RNA was extracted 
from plasma samples using the Qiagen MinElute Virus Spin kit (Qiagen) 
followed by first strand cDNA synthesis using SuperScript III reverse 
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transcriptase (Invitrogen Life Technologies) and the antisense primer env3out 
5/-TTGCTACTTGTGATTGCTCCATGT-3’. gp160 env was amplified using 
envB5out 5’-TAGAGCCCTGGAAGCATCCAGGAAG-3/ and envB3out 
5!-TTGCTACTTGTGATTGCTCCATGT-3’ in the first round and second round 
nested primers envB5in 5’-CACCT TAGGCATCTCCTATGGCAGGAAGAAG-3/ 
and envB3in 5’- GICTCGAGATACTGCTCCCACCC-3’. PCRs were performed 
using a High Fidelity Platinum Taq (Invitrogen) at 94°C, 2 min; (94°C, 15 s; 55°C 
30 s; 68°C, 4min) x 35; 68°C, 15 min. Second round PCR was performed with 1 jl 
of first PCR product as template and High Fidelity Platinum Taq at 94°C, 2 min; 
(94°C, 15s; 55°C 30s; 68°C, 4min) x 35; 68°C, 15 min. Sequence alignments, 
phylogenetic trees and mutation analysis of gp160 was performed by using 
Geneious Pro software, version 8.1.6 (Biomatters Ltd.)*°. Sequence analysis was 
performed using Antibody database by A. West’. Logograms were generated using 
the Weblogo 3.0 tool””. 

Pseudovirus generation. Selected SGS from virus culture supernatants and 
plasma were used to generate pseudoviruses and tested for sensitivity to bNAbs 
ina TZM.bl assay*". To produce the pseudoviruses, plasmid DNA containing the 
cytomegalovirus (CMV) promoter was amplified by PCR using forward primer 
5/-GTTGACATTGATTATTGACTAG and reverse primer 5’-CTTCCTGCCAT 
AGGAGATGCCTAAAGCTCTGCTTATATAGAC-CTC. The CMV promoter 
amplicon was fused to individual env SGS amplicons by PCR using forward 
primer 5’-AGTAATCAATTACGGGGTCATTAGTTCAT and reverse primer 
5!-ACTTTTTGACCACTTGCCACCCAT. Fusion PCR was carried out using the 
Expand Long Template PCR System (Roche) in a 6011 reaction consisting of 1 
ng purified CMV promoter amplicon, 0.125 1l unpurified env SGA amplicon, 
200 nM forward and reverse primers, 200|1M dNTP mix, 1x Buffer 1, and 1 jl 
DNA polymerase mix. PCR was run at 94°C for 2 min; 25 cycles (94°C for 12s, 
55°C for 30s, 68°C for four minutes); and 72°C for 10 min. Resulting ampli- 
cons were analysed by gel electrophoresis, purified without gel extraction, and 
co-transfected with pSG3Aenv into HEK293T cells to produce pseudoviruses as 
described previously. 

Statistical analyses. Adverse events were summarized by the number of 
participants who experienced the event, by severity grade according to the 
DAIDS AE Grading Table and by relationship to 3BNC117 as determined by the 
investigator. PK-parameters were estimated by performing a non-compartmental 
analysis (NCA) using WinNonlin 6.3. Kaplan-Meier survival curves were used 
to compare time to rebound in trial participants to participants in previous ATI 
studies conducted by ACTG”. To exclude the possibility that the observed delay 
in rebound is confounded by clinical factors, we compared the clinical variables 
between the control (ACTG trial participants) and treated group using a two- 
sided Fisher's Exact test for categorical variables (gender and CD4 Nadir) and an 
unpaired Wilcoxon test (two-sided) for continuous variables (age, years on ART 
and CD4 count before ATI initiation) (Supplementary Table 7). Additionally, we 
tested for each potential confounder whether the variable is predictive for the 
rebound time. Therefore, we built a univariate survival regression model for 
each potential confounder and compared those models to a null model using 
a likelihood ratio test (LRT), which determines how much better the more 
complex model explains the data than the less complex model. Confounders 
were considered significant if the model with the potential confounder had an 
LRT P value of 0.05 or less, which was the case for ‘years on ART” as well as ‘age’ 
for the comparison between the controls and the combined treatment group 
(Supplementary Table 7). We did not perform a standard Cox regression, since 
the proportional hazards assumption was not fulfilled for some of the variables. 
Rebound time was modelled using a log-normal distribution, which resulted in 
the best model fit as measured by Akaike information criterion (AIC) among 
several different distributions (Extended Data Fig. 4b-d). To determine the effect 
of the treatment after adjusting for the discovered confounders, we performed a 
weighted log-rank test*”. Therefore, for each sample inverse probability weights 
based on the discovered confounders were estimated, which were used to re-weigh 
the variables of the log-rank statistic. We performed a bootstrapped version of the 
weighted log-rank test, as recommended in ref. 42 owing to the small sample size. 
We estimated the class probabilities using a lasso logistic regression model trained 
with the Matlab function lassoglm with five lambda values in a threefold cross- 
validation. To improve stability, the optimal lambda for the lasso logistic regression 
was determined only once using the original labels and used in all bootstrap runs 
to train the models that estimate the class probabilities. 

Additionally, we performed an LRT at significance level a= 0.05 based ona 
parametric survival regression model adjusted for the discovered confounders. In 
this analysis the treatment group still significantly predicted the delay in rebound 
(Supplementary Table 7). For the analyses, the R (version 3.2.1) packages survival 
(version 2.38-3) and fitdistrplus (version 1.0-6) were used and Matlab (version 
R2015b) for implementation of the weighted log-rank test. 


Sequence and phylogenetic analysis. Nucleotide alignments were generated 
using ClustalW (v.2.11)*? and manually adjusted using Geneious R8 (v.8.1.6)°? 
and MacClade (v.4.08a)“*. Sites that could not be unambiguously aligned were 
removed for all phylogenetic analyses. Optimal evolutionary model classes were 
determined using jModelTest (v.2.1.4)*°. Maximum likelihood phylogenetic 
trees were generated using PhyML (v.3)*° with joint estimation of evolutionary 
model parameter values and phylogenies. The tree comparing all participants was 
midpoint rooted and each within-subject tree was rooted on the basal branch as 
determined by the between-subject tree. Sequences with premature stop codons 
and frameshift mutations that fell in the gp120 surface glycoprotein region were 
excluded from all deduced protein analyses. 

Sequences generated from the supernatants of viral outgrowth assays 
represented viruses that were present in the latent reservoir. Per assay, 4-8 million 
CD4* T cells were activated. In an HIV-infected person who is completely 
suppressed on antiretroviral therapy, it has been determined that 1 x 10° resting 
CD4 cells are latently infected with replication-competent virus’”. Thus, one would 
expect to identify up to eight distinct viral isolates per individual culture. Single 
genome sequencing of the culture supernatants revealed sets of clonally related 
sequences, which appear as ‘rakes’ in a phylogenetic tree (Extended Data Figs 7 
and 8). The most recent common ancestor of these rakes represents the reactivated 
virus that was present in the host (similar to the inference of infectious molecular 
clones as described in ref. 48). As shown in Extended Data Figs 7 and 8, sequences 
from culture reactions fall in 1-3 rakes within a given individual. We inferred 
each rake’s most recent common ancestor (MRCA) by building a majority-rule 
consensus and treated it as a single virus from the participant's latent pool. These 
MRCAs were used to build the phylogenetic trees shown in Fig. 3. 

Because mixed culture isolates replicated for 14 or more days, in vitro recombi- 
nants were observed. In vitro recombinants from culture reactions were identified 
and removed from the data set if they: (i) had two identifiable parental sequences 
within the same culture reaction; and (ii) exhibited three consecutive informative 
sites relative to one parent followed by three consecutive informative sites relative 
to another. We independently verified that a subset of these sequences showed 
evidence of recombination using the Recco tool (v.0.93)”. 

Assessment of rebound virus clonality. The Poisson Fitter v2 tool” is designed to 
test if set of homogeneous sequences exhibits random diversification. If such a set 
of sequences exhibits a star-like phylogeny and a Poisson distribution of pairwise 
differences (Hamming distances), it can be inferred that a single virus gave rise to 
those sequences. Poisson Fitter v2 tests these and other parameters using maximum 
likelihood methods and performs a x” goodness of fit test to obtain a P value. A non- 
significant P value signifies that the observed Hamming distances adhere to a Poisson 
distribution and it can be inferred that a single virus gave rise to rebound. Single 
genome derived env sequences from the plasma at the earliest time point post-rebound 
from each participant were tested using Poisson Fitter (Supplementary Table 8). 
ELISPOT T-cell response analysis. Interferon gamma Elispot assays were 
performed as described*!. Briefly, 96-well polyvinylidene plates (Millipore, 
Bedford, Mass.) were precoated with 0.5 g/ml of anti-IFN- monoclonal antibody, 
1-DIK (Mabtech, Stockholm, Sweden) and previously frozen PBMCs were plated 
at a concentration of 50,000 to 100,000 cells per well in a volume of 100,11 of R10 
medium (RPMI 1640 (Sigma), 10% fetal calf serum (Sigma), 10 mM HEPES buffer 
(Sigma)) with antibiotics (2 mM L-glutamine, 50 U of penicillin-streptomycin/ml). 
Plates were incubated overnight at 37 °C, 5% CO , and developed as described*!. 
Cells were tested against a panel of 410 B-clade overlapping 18-mer peptides (OLPs) 
spanning the entire HIV-1 genome (consensus sequence from 2001). These peptides 
were used in a matrix system of 11-12 peptides per pool to screen study participants 
for HIV-specific T cell responses. Confirmation of recognized individual peptides 
within a peptide pool was undertaken in an additional Elispot assay, as described. 
Wells containing PBMCs and R10 medium alone were used as negative controls and 
were run in duplicate on each plate. Wells containing PBMCs and phytohemaggluti- 
nin (PHA) served as positive controls. The numbers of spots per well were counted 
using an automated Elispot plate reader (ImmunoSpot Reader System; Cellular 
Technology Limited, Shaker Heights, OH). Responses were regarded as positive if 
they had at least three times the mean number of spot forming cells (SFC) in the two 
negative control wells and had to be >50 SFC/ 10° PBMCs (ref. 51,52). 
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indicate the time of viral rebound. For study participants 710, 712 and 715 
rebound occurred at week 19, 16 and 11, respectively. Baseline samples for 
study participant 710 and week 12 samples for study participant 714 were 
not available for ELISpot analysis. Overall, breadth, magnitude and protein 
specificity were heterogeneous among the study participants. 
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Extended Data Figure 4 | Viral rebound in ACTG control subjects 
and trial participants. a, Kaplan-Meier plot summarizing viral rebound 
in 52 ACTG trial participants who underwent ATI without antibody 
treatment (black, Supplementary Table 6) and trial participants 

(Fig. 2a, b, Supplementary Table 4). Six group A participants are shown 
in red, seven Group B participants in blue and the combination in 

green as indicated. The y-axis indicates the percentage of participants 
with viral levels below 200 RNA copies per ml, x-axis indicates weeks 
after ATI initiation. The survival curves of all considered partitions 

of the trial participants (group A, group B and group A + B) differed 
significantly at significance level a = 0.05 from the survival curve of the 
ACTG trial participants. For the comparison of group A (group A + B) 
with the ACTG trial participants, we performed a weighted log-rank test 
adjusting for the clinical variables ‘years on ART” and ‘age’ to correct for 
possible confounding factors (Supplementary Table 7, P < 0.00001). We 
identified those potential confounders by univariate parametric survival 
regression using a likelihood ratio test (Statistical Methods). Since we did 


not discover any confounders with the same analysis among all available 
clinical variables for the comparison between group B participants and 

the ACTG trial participants, we performed a standard log-rank test 

in that setting (P < 0.0001). b-d, In order to perform a survival regression, 
the distribution of the rebound times has to be determined. Therefore, 

we compared the empirical cumulative distribution function (CDF) of the 
rebound times (black, solid line) with the CDF of the rebound times to a 
fitted distribution (Weibull, exponential, normal, logistic, log-normal, and 
gamma) for each comparison group (combined trial participants, group 

A or group B with ACTG control patients). Since the Akaike information 
criterion (AIC) and the Bayesian information criterion (BIC) were smallest 
for the log-normal distribution (green), we have chosen to model the 
rebound times with the log-normal distribution. e, Dot plot indicating 

the relationship between cell associated HIV DNA in pre-infusion PBMCs 
(y-axis) and the week of rebound (x-axis). Group A and group B 
participants are coloured red and blue respectively. The P value was 
derived from calculating the Pearson correlation coefficient. 
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Extended Data Figure 5 | In vitro neutralization of pre-infusion and 
rebound virus outgrowth cultures by 3BNC117 or 10-1074. a, b, 
TZM-bl assay neutralization by 3BNC117 (a) and 10-1074 (b) are shown 
for individual virus outgrowth cultures derived from pre-infusion (black 
lines/symbols) or rebound (red lines/symbols) time points for each 
participant. In some cases, multiple independent cultures were grown 
from a single time point and assayed for neutralization (Supplementary 
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Table 3). ‘Screen’ refers to cultures of PBMC samples taken weeks before 
infusion during screening, while ‘Day 0’ refers to cultures of PBMCs 
collected immediately before the first 3B NC117 infusion. Rebound culture 
time points are denoted by the week (W) at which the samples were 
collected. Symbols reflect the means of two technical replicates; error bars 
denote standard deviation. 
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Extended Data Figure 6 | Phylogenetic tree of env nucleotide sequences 
from trial participants. A maximum likelihood phylogenetic tree was 
constructed from single-genome-derived viral env sequences from 
outgrowth culture supernatants as well as plasma from participants 

701 (olive), 702 (black), 703 (pink), 704 (yellow), 707 (light blue), 

708 (green), 709 (dark blue) and 711 (brown). Hypervariable (as defined 


704 


708 


709 


703 


701 


702 


707 


in http://www.hiv.lanl.gov/content/sequence/VAR_REG_CHAR/) and 
other poorly aligned regions were excluded from the analysis. The tree 
was constructed using PhyML with a GIR+1+G substitution model and 
midpoint rooted. Asterisks indicate 100% bootstrap support (only values 
for major nodes are shown). The scale bar indicates 0.01 substitutions 
per site. 
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Extended Data Figure 7 | Rebound virus clonality and neutralization 
sensitivity to 3BNC117. Maximum likelihood phylogenetic trees of 
plasma and culture-derived env sequences are shown for participants 
701, 702, 703, 704. Sequences obtained at screening, on Day 0, and 
consecutive rebound time points (plasma and cultures) are colour coded 
as indicated. The trees were rooted based on the branch insertion 
identified in the between-subject tree (Extended Data Fig. 6). Bootstrap 
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values >90% are shown. Names of env sequences used to generate 
pseudoviruses for 3BNC117 neutralization analysis are indicated along 
with the respective ICgo titres in jg ml~!. Representative rebound viruses 
selected in Fig. 4b are marked with red stars (Fig. 4b, Supplementary 
Table 9). Zero branch length viruses in multi-rebounders 702 and 703 
are marked with black stars. 
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sensitivity to 3BNC117. Maximum likelihood phylogenetic trees of 
plasma- and culture-derived env sequences are shown for participants 
707, 708, 709 and 711. Sequences obtained at screening, on day 0, and 
consecutive rebound time points (plasma and cultures) are colour coded 
as indicated. The trees were rooted based on the branch insertion 
identified in the between-subject tree (Extended Data Fig. 6). Bootstrap 
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Mitochondrial and nuclear DNA matching shapes 
metabolism and healthy ageing 


Ana Latorre-Pellicer’*, Raquel Moreno-Loshuertos*, Ana Victoria Lechuga-Vieco!*, Fatima Sanchez-Cabo!, Carlos Torroja!, 
Rebeca Acin-Pérez!, Enrique Calvo!, Esther Aix!, Andrés Gonzdlez-Guerra', Angela Logan®, Maria Luisa Bernad-Miana®, 

Eduardo Romanos*, Raquel Cruz’, Sara Cogliati!, Beatriz Sobrino’, Angel Carracedo””", Acisclo Pérez-Martos®, 

Patricio Fernandez-Silva’, Jestis Ruiz-Cabello!*°, Michael P. Murphy’, Ignacio Flores!, Jestis Vazquez! & José Antonio Enriquez!* 


Human mitochondrial DNA (mtDNA) shows extensive within- 
population sequence variability'. Many studies suggest that 
mtDNA variants may be associated with ageing or diseases”*, 
although mechanistic evidence at the molecular level is 
lacking®®. Mitochondrial replacement has the potential to 
prevent transmission of disease-causing oocyte mtDNA. 
However, extension of this technology requires a comprehensive 
understanding of the physiological relevance of mtDNA sequence 
variability and its match with the nuclear-encoded mitochondrial 
genes. Studies in conplastic animals’~° allow comparison of 
individuals with the same nuclear genome but different mtDNA 
variants, and have provided both supporting and refuting evidence 
that mtDNA variation influences organismal physiology. However, 
most of these studies did not confirm the conplastic status, focused 
on younger animals, and did not investigate the full range of 
physiological and phenotypic variability likely to be influenced 
by mitochondria. Here we systematically characterized conplastic 
mice throughout their lifespan using transcriptomic, proteomic, 
metabolomic, biochemical, physiological and phenotyping 
studies. We show that mtDNA haplotype profoundly influences 
mitochondrial proteostasis and reactive oxygen species generation, 
insulin signalling, obesity, and ageing parameters including 
telomere shortening and mitochondrial dysfunction, resulting 
in profound differences in health longevity between conplastic 
strains. 

The mtDNAs of C57BL/6 and NZB/OlaHsd mice differ by 12 missense 
mutations, 4 transfer RNA (tRNA) mutations, 8 ribosomal RNA 
(rRNA) mutations, and 10 non-coding-region mutations (Extended 
Data Fig. 1b, Extended Data Table 1 and Supplementary Data 1), a 
level of divergence comparable to that between human Eurasian and 
African mtDNAs. We developed a conplastic mouse strain (Extended 
Data Fig. 1a) with the C57BL/6 nuclear genome and the NZB/OlaHsd 
mtDNA, hereafter referred to as BL/6N2®. The original C57BL/6 strain 
with the C57BL/6 nuclear genome and C57BL/6 mitochondrial genome 
is hereafter referred to as BL/6“”. Nuclear genome purity was con- 
firmed (Extended Data Fig. 1c—e). Conplastic animals, BL/6°’ and 
BL/6N25, were fertile, and on a chow diet there were no weight dif- 
ferences throughout the growth period (Extended Data Fig. 2a, b), 
with marginal differences in activity observed in metabolic chambers 
(Extended Data Fig. 2c-e). 

Unexpectedly BL/6N® extended median lifespan by 16% without 
modifying maximum lifespan (Fig. 1a). After 9 months, BL/6N® 
mice gained less weight (Fig. 1b). Two-year-old BL/6“” animals 
manifested more signs of ageing’? than age-matched BL/6“® mice 
(Fig. 1c, Extended Data Fig. 3a, b); including ovary preservation, with 


less lipofuscin accumulation at 12 months in BL/6N78 mice (Extended 
Data Fig. 3e, f). Telomeres in one-year-old animals were on average 
approximately 11% longer in BL/6“*’ mice, whereas in two-year-old 
animals this was inverted, with BL/6‘® telomeres slightly longer 
(Fig. 1d). This is due to a BL/6°°” telomere-length-reduction rate 
that is double than that of BL/6N“® mice (Fig. le). Liver necropsy 
revealed a higher incidence of tumours in BL/6“” mice at death 
(Fig. 1f and Extended Data Fig. 3c, d), suggesting greater genome 
instability. 

Four days after birth, respiration and ATP synthesis were lower in 
liver mitochondria from BL/6‘8 mice. Both variables subsequently 
equalized, but from 100-days-old onwards, the rates in BL/ 6°” liver 
mitochondria decreased progressively (Fig. 1g). These changes were 
due to a steady decline in BL/6” respiration, with BL/6N”® respiration 
remaining constant between days 20 and 300 (Fig. 1h). Liver mtDNA 
sequencing revealed no age-related increase in mt DNA mutations in 
either mtDNA backgrounds (Extended Data Table 2, Supplementary 
Datal). Later metabolic deterioration, lower tumour incidence, 
slower telomere attrition!!, and sustained ovarian function indicate 
that the NZB mtDNA promotes healthier ageing in the BL/6 nuclear 
background. 

Gene expression signatures of 12-week-old mice (Fig. 2a and 
Supplementary Data 2) revealed differences in molecular functions 
related to ‘lipid and carbohydrate metabolism in liver and heart and 
‘free radical scavenging’ specifically in liver (Fig. 2b). Consistent with 
increased PPARa activity, BL/6N“® mice had higher expression of 
lipid metabolism and lower expression of carbohydrate metabolism 
and inflammation pathways (Extended Data Fig. 4 and Supplementary 
Data 2). 

Plasma and liver metabolomics of 12-week-old mice confirm the 
mtDNA haplotype effect in three main groups: glutathione metabo- 
lism; digestion and absorption; and lipid and membrane metabolism 
(Fig. 2c, Extended Data Fig. 5 and Supplementary Data 3). Metabolite 
set enrichment analysis (MSEA) of glutathione metabolism revealed 
enhanced glutathione turnover’’ and reactive oxygen species (ROS) 
defence in BL/6N“8 animals (Fig. 2d and Extended Data Fig. 5b). In 
parallel, we detected lower levels of the hepatic lipid peroxidation 
product 12,13-hydroxyoctadec-9(Z)-enoate. Other affected metab- 
olites included a number of fatty acids and lysophospholipids, con- 
sistent with the differences in membrane metabolism (Extended Data 
Fig. 5d). 

Quantitative proteomics revealed a differential profile for the ROS 
defence proteome (Fig. 2e, f and Supplementary Data 4). In liver, 
the major mitochondria-located redox regulators Glrx2, Glrx5 and 
Txnrd2 were increased in young and old BL/6N”® mice, whereas 
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la salud, Zaragoza 50009, Spain. ’Fundacién Publica Gallega Medicina Xendmica, SERGAS, Complexo Hospitalario Universitario de Santiago, 15706 Santiago de Compostela, Spain. 8Center of 
Excellence in Genomic Medicine Research, King Abdulaziz University, 21589 Jeddah, Saudi Arabia. 2Universidad Complutense de Madrid, Madrid 28606, Spain. 


28 JULY 2016 | VOL 535 | NATURE | 561 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a b 407 c d fl 1-year-old i 2-year-old 
100 tee Hity re ee 
754 —~ BL/6°7 35 4 +9 + °e a at 
3g > BL/eNze S ese thes, + $50 >=101_9 | g 
2 § 304 / c] 3 > 
5 504 2 sg - ole 3 
5 = i 8 2 
° e C57 o 5 e 
5 254 25 4 BUG D> E 
a -o BL/6NZ8 @ 
2 
0 20 : 1 1 1 
0 200 400 600 800 1,000 1,200 0 200 400 600 800 BL/ee? — BL/eNzB 150 200 250 300 
Age (days) Age (days) Telomere length (a.u.) 
e f HWtumour [No tumour g -@ Pyr+mal -®- Glu + mal ~®- Succinate h 
20 0.54 nmol O, min“ per mg protein 
- 255 300 ® BL/67 glu + mal 
= § ® BL/6°7 pyr + mal 
§ 204 15 + _ 2) 
=e ia a 
8 8 2 200 8 
B 154 * g a = 
s 3 10 2 a 
e 104 2 = L 
2 ce) ‘E 1004 
2 a a 
e 54 : s g ® BL/6"28 glu + mal 
Ss 100 fe} 
o 2 — @ BL/6N28 pyr + mal 
cal os 4 S: 0 
0- 0 ; 4 21 65 115280 680 1 10 100 
BL/6°57— BL/6NZ8 BL/6?s’ —s BL/6N28 Age (days) Age (weeks) 
Figure 1 | Conplastic animals differ in metabolism and ageing profiles. rank sum test). e, Telomere length reduction during second year of life 
a, Survival curves (1 = 31 per genotype (19 males, 12 females); Gehan- (mean +s.e.m. *P < 0.05, two-tailed t-test). f, Tumour incidence in 
Breslow-Wilcoxon test *P < 0.05). b, Weight gain (chow diet, n = 19 males animals dying from natural causes (n = 14 for BL/ 6N28 n= 20 for BL/6“’; 
per genotype, mean +s.e.m., *P < 0.05, two-tailed t-test). c, Mice that were **P<0.01, Fisher's exact test). g, Relative O02 consumption (coupled) 
~2 years old were scored for ageing parameters (n = 5 for BL/6N”8, n=4 and ATP synthesis rates in liver mitochondria (n = 4 per genotype and 
for BL/6™”). For more information on ageing parameters see Extended age; mean + s.d.). h, O2 consumption (coupled) respiration in liver 
Data Fig. 3a. a.u., arbitrary units. d, Telomere length in hair follicle cells mitochondria simultaneously isolated from 22-, 30-, 120- and 300-day-old 
of 1- (n=6 per group) and 2-year-old mice (n=5 for BL/6N28, n=4 males (n= 8). 
for BL/6°°”). Red lines indicate mean length (***P < 0.001, Wilcoxon's 
a b Liver Liver and heart c 
— I 1 
aw 10 ° cs Liver BL/6°*” Liver BL/6"28 Plasma BL/6°°’ Plasma BL/6N28 
‘ Liver BL/6NZ® H.BL/6"28 vs BL/6C57 Liver and plasma (197) Liver (313) Plasma (287) 
3 05 } 7 
) : Heart BL/6°5” L.BL/6%28 ys BL/6C57 : 10 i 10 : 
rig an Poe ' fap 
D = ae 5 Seta oo 1 H 5 ‘ 
8 0.0 S@EMSESPOSCSS ELL ORESERSES GEE Die wf 5 : 
2 eodeeeecaggigecslegsasgaeie? EX prey gf tt ol 
is Heart BL/6N?® See eee ese PRS RES SESE AT ESSER SES Ss ' 7 j an ae 
Q -05 SORTEDPES TP SVBSSSVSLESER TOE xe e eles i 
8 _-Liver BL/6®®7 S Esmbasegveteegesosr assesses - ae Sy 4 
at . S CSPSSSSSES Oey SxX~ HVE HES ‘ i ‘ : 1) -10 ia 
-1.0 RB pt GOest lo sss CSS esEltaeg 5 0 5 5 Oo 5 5 0 5 
ate Ee aases ahr eoaESst 7 @ “ 
— . . : . : 8 E Zo fogs ° g3s oPe F X-variate 1 X-variate 1 X-variate 1 
6 42 02 46 ie z 28a 5 3885 3 (23, 9, 14) (23, 9, 14) (24, 18, 6) 
SB BG 
Leading logFC dim 1 & 3° 
d e mtDNA: Age: f mtDNA: Age: 
Zq NZB -Zq C57 Zq Old - Zq Young Zq NZB -Zq C57 Zq Old - Zq Young 
log,,(fold change} % On 
94 0( ge) ye SSS 2% > 
ia Gatt aL 
0.15 1 0.15 C57 NZB -— c Prdx3 Ggtt 
LP Biochemical name Txnrd2 C57 NZB mac, 
5-oxoproline FDR MSA Sod2 = | Txnraa 
[| _|Glutamate ; = -5.0 1 5.0 -5.0 1 5.0 Gss 
[i Glutathione, oxidized Liver Plasma . : 
ae PI ger 
alu e (I 
t glatimlicucire 0,008 0.14 |__| Prdx4 
y-glutamyltyrosine Gpx3 
'y-glutamylmethionine Sod3. 
y-glutamylglutamine _— Girx3. 
'y-glutamylphenylalanine Cat 
y-glutamylisoleucine — | Nnt 
'y-glutamyltryptophan (oo) Girx5 
y-glutamylalanine | | _|Gclm 
Glycine -| Gsr 
Prater butyrate | I Txn 
-hydroxybutyr 
3 Barnincbuayeta r| Prdx2 
Ophthalmate Bi Txnrd1 
n Sod? 
[(Cysteine-glutathione disulfide L| PrdxS 
|S-lactoylglutathione Lr] Prdx6 
Pag tenone Oxidative stress Se 3 
-hode + 9-hode i. 
12, 18-hydroxyoctadec-9(z)-enoate biomarkers rf Glrx5 Gpx4 
Allantoin aa Txn2 Txn2 


Figure 2 | Transcriptomic, metabolomics and proteomics adaptations 
to mtDNA haplotype. a, Multidimensional scaling (MDS) representation 
of RNA-seq analysis (10-weeks-old, n = 3 pools per genotype, 4 mice 

per pool). b, Heatmap and hierarchical clustering of enrichment (—logio 
Benjamini—-Hochberg (BH)-adjusted P values) of the functions defined 
by the ingenuity pathway analysis (IPA) in set of genes differentially 
expressed between conplastic mice in heart (H) (top) and liver (L) 
(bottom). Yellow, non-significant (BH P= 1); red, highly significant 

(BH P < 10‘). c, Partial least squares regression discriminant analysis 
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(PLS-DA) plot for plasma and liver metabolites (10-weeks-old, n = 6 

per genotype). d, MSEA of all detected liver (L) and plasma (P) GSH 
metabolites regardless of relevance to PLS-DA (red text denotes PLS-DA 
relevant), and of selected oxidative stress metabolic biomarkers. 

e, f, Heat maps and hierarchical clustering of enrichment (Zq differences 
between indicated genotypes and/or ages) of the proteins quantified by 
iTRAQ involved in ROS metabolism in either liver (e) or heart (f). See 
Supplementary Data 4 for identity of different proteins and Zq values. 
Purple text denotes mitochondrial location (n = 3 per genotype and age). 
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cytosolic gamma-glutamyltranspeptidase 1 (involved in glutathione 
pool maintenance) and four mitochondrial redox regulators (Prdx3, 
Txnrd2, Sod2 and Nnt) were decreased in young BL/6N8 mice but then 
strongly increased on ageing (Fig. 2e). Ageing has profound but different 
effects on the ROS handling proteomes of the two conplastic mouse lines 
(Fig. 2e). 

Measurement of mitochondrial ROS in vivo using MitoB' revealed 
elevated ROS levels in young BL/6N“® mice in both liver and heart 
(Fig. 3a). As shown in cultured cells“, high levels of mitochondrial ROS 
correlated with mtDNA copy number (Fig. 3b). Chronic treatment of 
mice with N-acetyl cysteine (NAC) reduced the BL/6N”8 MitoP:MitoB 
ratio (Fig. 3a); equalized mtDNA copy number (Fig. 3b); and reduced 
respiration and ATP synthesis rates in isolated liver mitochondria 
(Fig. 3b). The higher rate of mitochondrial ROS in young BL/6N7® 
animals (Fig. 3a) seems to conflict with their superior functional 
preservation with age. However, mitochondrial ROS production was 
inverted in animals above 30 weeks of age, with BL/6\* liver mito- 
chondria from middle-aged mice producing similar levels of ROS to 
young animals, whereas BL/6°°” mitochondria showed a sharp, age- 
related increase in ROS (Fig. 3c). 

There were no dramatic differences in mitochondrial complex or 
supercomplex assembly (Extended Data Fig. 6a), and both strains 
showed the expected loss of supercomplex CIII:CIV due to the mutant 
form of SCAF1 characteristic of the BL/6 nuclear background!®. 
However, BL/6N8 mice of all ages showed a slightly but consistently 
lower level of CI:CIII superassembly (Extended Data Fig. 6a, b). 

Quantitative proteomics confirmed consistent behaviour pat- 
terns and between-genotype differences for structural proteins of all 
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respiratory complexes, independently of whether they are encoded by 
nuclear or mtDNA genes (Extended Data Fig. 6c). In agreement with 
the differences in respiration (Fig. 3b), CI was less abundant in liver 
of young BL/6N78 mice (Extended Data Fig. 6c). In parallel with the 
age-related decline in respiration in BL/6°” mice (Fig. 1h), the amounts 
of respiratory complexes in liver decrease in BL/6©” but not in BL/6N“8 
mice (Extended Data Fig. 6c). An increase in the proportion of free CIII 
compared to that superassembled with Cl is associated with adapta- 
tion to a fuel switch from glucose to fatty acids’®. Consistent with this, 
the proteomic analysis revealed liver content changes in glycolytic and 
B-oxidization enzymes (Extended Data Fig. 6d). 

Despite the influence of mtDNA haplotype on glucose and fatty acid 
metabolism, young mice showed no inter-strain differences in plasma 
glucose or in weight gain. In contrast, in one-year-old mice, the fasting- 
induced reduction in plasma glucose was significantly lower in BL/6°” 
mice (Fig. 3d). This difference correlated with a lower capacity of 
BL/6°*” mice to regulate insulin levels (Fig. 3e) and to clear plasma 
glucose (Fig. 3f). Furthermore, on a high-fat diet (HFD), young BL/6©*” 
mice showed greater increases in body weight and white adipocyte 
size (Extended Data Fig. 7b, c). BL/6“°” mice showed an age-related 
increase in blood cholesterol, whereas levels in BL/6\“® remained 
steady (Extended Data Fig. 7a) 

The mtDNA haplotype influence in lipid and glucose metab- 
olism was confirmed by in vivo magnetic resonance spectroscopy 
(MRS) analysis of lipid content in liver, heart and skeletal muscle. 
Conplastic animals showed no differences in lipid content at 20 weeks 
(Fig. 3g); however, cardiac fibres of BL/6N2® mice had a signifi- 
cantly higher content and size of lipid droplets, suggesting a more 
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Figure 3 | Functional differences among conplastic strains. a, Relative 
amount of HO, + NAC. b, Mitochondrial performance + NAC. Liver: 
relative mtDNA amount; oxygen-dependent (coupled) respiration; 
pyruvate + malate-driven ATP production in isolated mitochondria. 
Heart: relative mtDNA amount. (12-week-old animals; n = 6 per 
genotype). c, Relative liver mitochondrial HO. generation (126 days 

old, n= 2 per genotype; 210 days old, n= 6 per genotype). d, Blood 
glucose after the indicated period of starvation (14 months old, n=7 per 
genotype). e, The homeostasis model for insulin resistance (HOMA-IR) in 


Time after glucose (min) 


50 100 


4.7 p.p.m. 


overnight-starved mice (14 months old, n= 12 per genotype). 

f, Glucose tolerance test in starved mice (14 months old, n =7 per 
genotype) g, Longitudinal quantification of lipid content by 'H MRS 

for the indicated tissue, age and genotype (number of individuals in 
brackets). h, Representative heart TEM images illustrating differences in 
mitochondria and lipid droplets between 20-week-old conplastic mice 
(n=3 per genotype). i, Quantification of lipid content by ‘H MRS in 
skeletal muscle (soleus) of 80-week-old animals (n = 4 per genotype). 
Data are means +s.d., *P < 0.05, two-tailed t-test. 


28 JULY 2016 | VOL 535 | NATURE | 563 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a Liver b Heart 
mtDNA: Age: mtDNA: Age: 
Zq BL/G®’ - ZqBL/6’® _Zq 2y -Zq 12w Zq BL/6’ - ZqBL/6™® Zq 2y - Zq 12w 
5B 8 BY 
é 3 a | 8 8 
i > No- § § +5 TS = > Non ££ & 
Protease - ™ detected M Protease - “ detected m m 
a Xpnpep3 io Xpnpep3 
= Pitrm1 = 
Bx pmpea Px 
© Sos Park7 o Ss 
= 4.9 Pmpcb € ra) = 
= gF& CLpp tel 2 
© S Lonp1 © Ss 
3 Inne 3 Inner 
2 membrane | membrane 
g je pe] a 
- = 
SZ] £e S| 68 
S| s& S| ES 
S Ea S go 
£ a 2 = 
# < = 
= = = Outer 
membrane 
Outer 
membrane Unknown 
Unknown Prss35 * Immp1l/2I; Omat; Parl; Spg7 
c d 
mtUPR mtUPR 
Mitochondrial Mitochondrial 
dynamics dynamics 


12-week-old BL/6°5”  12-week-old BL/6\28 


80-week-old BL/6°5” 


80-week- 


ah I 


5 um 


g Area (pixels?) Aspect ratio Lipid number Lipid size (pixels) 
NS  #ee* ay wee NS pod 
tek | se 
0.15 NS wke« 1:5 ra 100 0.06: ar 
SF '— a 
0.10 1.04. 0.04 
0.05 0.54: 0.02 
0.00 0.0 -—+ 0 T 0 z = 
MitoQ - + - + = ep eno ae a 2 


- + + 
200 274 186 168 


BL/6°57 BL/eNZB 


n: 200 274 186 168 


BL/6°57 BL/6NZ8 
Figure 4 | Proteostasis, mtUPR and dynamics. See also Supplementary 
Data 4. a, b, Proteomic analysis of the mitodegradome. c, d, Detected 
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BL/6°*” mice aged 12 weeks (w) or 2 years (y) (mtDNA) or between old 
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active mobilization of lipids to the heart (Fig. 3h). The MRS analysis 
revealed that mature BL/6N7® adults (40 weeks) had higher tissue 
lipid content despite their lower body weight (Fig. 3g). At 80 weeks, 
lipid content in both strains was similar in liver and heart but was 
significantly higher in BL/6©°” muscle (Fig. 3g, i). This may reflect 
the accelerated deterioration of older BL/6°” mice, with replacement 
of muscle mass by fat. 

The mitochondrial degradome (mtDg) is involved in protein syn- 
thesis regulation, quality control, biogenesis, dynamics, mitophagy, 
and apoptosis!”. Quantitative proteomics revealed between-strain 
differences in the mitochondrial degradome, especially in the heart, 
that were strongly influenced by age (Fig. 4a, b). Matrix proteases 
and chaperones involved in the mitochondrial unfolding response 
(mtUPR)!® were elevated in BL/6N“® animals, suggesting more active 
proteostasis (Fig. 4c, d). This was especially prominent in the liver, 
where elevation in stress chaperones was accompanied by elevation of 
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and 80-week-old mice (2 animals per genotype). f, Representative TEM 
of cardiomyocytes from 12-week-old animals treated with MitoQ (n =3 
per genotype). g, Morphometric analysis of mitochondria in TEM 
images of left ventricular cardiomyocytes obtained from 3 animals per 
group. Data are given as means + s.e.m., non-parametric Kruskal-Walls 
test (not normal distribution), *P < 0.05; **P< 0.01; ***P< 0.001; 
**** P< 0.00001; NS, not significant. 


proteases characteristic of the matrix (LonP1 and Clpp) and implicated 
in mtUPR (Fig. 4a, c). 

The conplastic strains showed differences in critical components of 
the apparatus involved in mitochondrial dynamics (Fig. 4c, d). Heart 
transmission electron microscopy (TEM) revealed between-strain 
differences in mitochondrial aspect and dynamics, affecting density, 
morphology and cristae area (Fig. 4e-g, Extended Data Fig. 8); with more 
mitochondrial fragmentation in hearts of old BL/ 6~*” mice (Fig. 4e). 
When young mice were treated with the mitochondria-specific anti- 
oxidant MitoQ”, heart mitochondria were less rounded in BL/6N“8 
mice (Fig. 4f) and had a decreased mitochondrial area (Fig. 4g). 
MitoQ did not affect lipid droplet-accumulation between BL/6°*” and 
BL/6N8 cardiomyocytes, but increased droplet size in BL/6“” animals 
(Fig. 4g). 

By highlighting the influence of mtDNA and nuclear (nDNA) inter- 
play, our results demonstrate that substitution with a different wild-type 
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mtDNA variant is sufficient to promote differences in mitochondrial 
function and a cellular adaptive response. This phenomenon is orches- 
trated by the complex network of mitochondrial stress response path- 
ways in which proteostasis, mtUPR and ROS signalling lead to adaptive 
responses that impact on the organism's metabolic performance and 
ageing. The potential for non-pathological mtDNA variants to shape 
organismal metabolism has implications for the emerging new field 
of mitochondrial replacement therapy in human oocytes”””!. Thus, 
consideration should be given to the use of donor mtDNA that is a 
close genetic match to the original mtDNA of the acceptor oocyte”. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded to allocation 
during experiments and outcome assessment. 

Generation of conplastic mice and genotyping methodologies. Parental 
C57BL/6JOlaHsd and NZB/OlaHsd strains were purchased from Harlan 
Laboratories. The C57BL/6JOlaHsd strain was derived in 1974 from the C57BL/6J 
strain from Jackson Laboratories but differs from the parental strain. The 
C57BL/6JOlaHsd strain does not harbour the nicotinamide nucleotide transhy- 
drogenase (NNT) spontaneous mutation that renders the encoded enzyme unde- 
tectable, and which is characteristic of the C57BL/6J strain provided by Jackson 
Laboratories. To avoid confusion, we call C57BL/6JOlaHsd mice C57BL/6 through- 
out the manuscript. Conplastic strains were obtained by backcrossing females 
(mitochondrial donors) with males of the parental recipient strain over 20 gener- 
ations to create a new line harbouring the nuclear genome of one strain and the 
mtDNA of the other. Conplastic strains were maintained by backcrossing females 
(mitochondrial donors) of the conplastic strain with males of the parental recip- 
ient strain. Offspring after the tenth generation of backcrossing were considered 
conplastic. Total DNA extracted from tissue homogenates was genotyped by PCR 
restriction-fragment length polymorphism analysis (RFLP) and by Affymetrix 
Mouse Diversity Genotyping Array, which interrogates 623,000 single nucleotide 
polymorphisms (SNPs) covering the entire mouse genome at a resolution of 4.3 kb. 
mtDNA next-generation sequencing. Liver samples were taken from 
12- (n=5 per genotype), 40- (n=4 per genotype) and 80-week-old (n=5 
per genotype) conplastic mice. Total DNA was extracted with the DNeasy 
Blood and Tissue kit (Qiagen). mtDNA was amplified in two independ- 
ent reactions using primers 7309Fw (CATAGGGCACCAATGATACTG) 
— 16265Rv (GAGTTTTGGTTCACGGAACAT) and 15358Fw (AAGAA 
GAAGGAGCTACTCCCC) — 8257Rv (CAGCTCATAGTGGAATGGCTA). 
Long-range PCRs were carried out using Takara Taq DNA Polymerase (Clontech). 
Products were cleaned using the QiAquick PCR Purification kit (Qiagen), quan- 
tified with the Qubit dsDNA BR Assay kit (Invitrogen) and mixed in equimolar 
amounts. mtDNA was then enzymatically sheared using the Ion Shear Plus Reagent 
(Ion Torrent) following the manufacturer’s instructions. To yield fragments with 
a median size of 200 bp, the shearing reaction was adjusted to 8 min at 37°C, and 
fragments were size selected using Agencourt AMPure XP Reagent (Beckman 
Coulter). Both ends of each fragment were ligated with Ion P1 Adapters and the 
Ion Xpress Barcode Adapters. The ligated DNA was purified using Agencourt 
AMPure XP Reagent. Quantification and qualification for the amplified libraries 
were quantified and their quality assessed with Agilent 2200 TapeStation System 
and the Agilent High Sensitivity D1000 Screen Tape (Agilent Technologies) and 
with the Ion Library TaqMan Quantitation Kit (Ion Torrent). All barcoded libraries 
were pooled in equimolar amounts (50 pM) to ensure equal representation of each 
barcoded library in the sequencing run. Template preparation and Ion PI Chip 
loading were performed on the Ion Chef Instrument. NGS was performed on Ion 
Proton platform (Ion Torrent) with Ion PI Chip. 

All raw reads were aligned to the C57BL/6J mouse mtDNA reference sequence 
(AY172335.1) using Torrent Mapping Alignment Program (TMAP) version 5.0.7. 
Variants were called using the Torrent Variant Caller (TVC) version 5.0.7, apply- 
ing a minimum allele frequency threshold of 0.001 and reducing the coverage to 
10,000 in over-sampled locations (keeping default values for somatic mutations 
for the rest of parameters) and then filed into variant call format (VCF) reports. 
Additionally, the relative frequency of the alternative allele was calculated for each 
individual and position. Cases deviating more than five standard deviations from 
the average frequency in that position for the 28 individuals were compared with 
variants obtained from TVC. In this way false positives (positions identified as 
variants in TVC but not clearly deviating from the mean value) were identified 
and discarded from the final variant set. 

At the same time, variants found in the complete set of NZB individuals but 
lacking in all C57 individuals were identified as sequence differences between NZB 
mtDNA and C57 mtDNA. Extended Data Fig. 1b was generated with Geneious 
version 9.1.2, created by Biomatters. 

Mouse experimentation. All animal procedures conformed to EU Directive 
86/609/EEC and Recommendation 2007/526/EC regarding the protection of ani- 
mals used for experimental and other scientific purposes, enforced in Spanish 
law under Real Decreto 1201/2005. Approval the different experimental protocols 
require the estimation of the adequate sample size as well as the definition of the 
randomization and blinding criteria. Mice under endpoint criteria were excluded 
from analysis in long-term experiments. The mice were fed a standard chow diet 
(5K67 LabDiet) or, when indicated, a high-fat diet (D12492 Open Source Diets). 
Unless specified, experiments were carried out with males. 

Growth curves. To calculate growth curves, mice were weighed at the starting age 
and every week/month thereafter for the period indicated in each experiment. 


Three independent groups of animals were used for Fig. 1b and Extended Data 
Figs 2a and 6b. 

Survival study. Mice found dead at each daily inspection were considered as cen- 
sored deaths, and were necropsied to tumour inspection. Criteria for euthanasia 
were based on an independent assessment made by a veterinarian, and only cases 
where the condition of the animal was considered incompatible with continued 
survival are represented as deaths in the curves. Survival curves, weight gain and 
tumour incidence analysis (Fig. 1a, b, f) were done in the same group of animals. 
Behaviour and metabolism analysis. To analyse food and water intake, an inde- 
pendent group of 10-12 males per strain were individually housed, and weekly 
intake was calculated as the difference between the supplied weight of food and 
drink and the weight remaining after one week. Intake was calculated in males 
because females were grouped and had to compete for food and drink. Mouse 
activity was evaluated in 10-week-old mice using the PHYSIOCAGE system 
(Panlab). Animals were acclimatized to the room and cages in the week before the 
experiment, and measurements were made on four consecutive days (five days 
and four nights). Data were analysed with Metabolism v2.2 (Panlab). The first 
diurnal phase (12h) was considered part of the acclimatization period and was 
eliminated from the final result. Indirect calorimetry measurements were made 
of individual 10-week-old mice (10 per sex and strain) housed in Oxylet System 
metabolic chambers (Panlab Harvard Apparatus). Before experiments, mice were 
acclimatized in the cages for 1 week. Throughout the 24h measurement period, 
oxygen consumption and carbon dioxide production rates were measured at 
24-min intervals, with a 4min sample period and a 1 min purge; simultaneous 
measurements were made in four metabolic chambers. Respiratory exchange rate 
(RER, or respiratory quotient, RQ), a measure of metabolic substrate choice, was 
calculated as the ratio between CO, production and O2 consumption. Energy 
expenditure (EE) was calculated as (3.815 + (1.232 x RQ)) x VO» x 1.44). Data 
were analysed with Metabolism v2.2 (Panlab). Investigators were not blinded to the 
group allocation during the experiment and the sub-groups of animals included 
in repeated studies was randomly organized. 

Glucose metabolism and blood biochemistry analysis. Experiments were per- 
formed in an independent group of 12 mice per genotype. Mice fed the chow diet 
were fasted for 6 or 16h with free access to water. For intraperitoneal (IP) glucose 
tolerance tests (GTT), the mice received glucose injections of 1 or 2mg per kg. 
Mice were bled from a tail clip. Blood glucose was measured with a handheld 
glucometer before injection (time 0) and at 15, 30, 60, 90 and 120 min after injec- 
tion. Insulin was measured by an enzyme-linked immunosorbent assay (ELISA) 
(Millipore), 3 technical replicates per sample. The homeostasis model for insulin 
resistance (HOMA-IR) was calculated from the fasting blood glucose (mmol per 
litre) x fasting plasma insulin (\1U ml) divided by 22.5. 

Levels of serum cholesterol (ChoT), low-density lipoprotein (LDL), and 
high-density lipoprotein (HDL) were determined by biochemical analysis of sub- 
mandibular blood samples. Serum biochemistry was conducted in a Dimension 
RxL Max automated analyser. Investigators were not blinded to the group alloca- 
tion when assessing the outcome. 

Histological analysis. Tissue samples were fixed in 4% paraformaldehyde (24h), 
processed and embedded in paraffin. Sections (5,1m) were prepared and mounted 
on coverslips for staining with haematoxylin and eosin and tissue images were 
captured using a NanoZoomer 2.0RS digital slide scanner (Hamamatsu). Sections 
of epididymal fat pad stained for white adipose tissue were examined under a 
Nikon 90i microscope fitted with a 10x objective. Adipocyte cross-sectional area 
was determined from epididymal fat pads using Image] software (4 sections per 
animal). Slides from liver necropsies were digitalized and analysed with NDP.view2 
viewing (Hamamatsu). Lipofuscin area was determined from digitalized haema- 
toxylin and eosin histological ovary sections (3 sections per ovary) using NDP. 
view2. Measurements were obtained by applying colour thresholds with Image] 
1.48v (NIH). Investigators were blinded to the group allocation when assessing 
the outcome. 

In vivo ROS determination. In vivo mitochondrial hydrogen peroxide was meas- 
ured using the MitoB mass spectrometric probe as described previously'*”’. Briefly, 
25nmol MitoB was injected into the tail vein between 8 a.m. and 10 a.m. After 
6h, mice were euthanized by cervical dislocation and tissues were harvested and 
flash frozen in liquid nitrogen. Tissues were stored at —80°C and transported to 
Cambridge, UK on dry ice for analysis. When indicated, mice were treated with 
1% NAC in drinking water for 8 weeks before MitoB injection. Investigators were 
blinded to the group allocation when assessing the outcome. 

OXPHOS function. Mitochondria were isolated from mouse liver samples as 
described, and oxygen consumption and ATP production were measured». For 
mtDNA copy number quantification, total DNA was isolated from tissue homoge- 
nate preparation (liver and heart) and was quantified by real-time PCR using 
7900HT Fast Real-Time PCR System (Applied Biosystem) as described'!. When 
indicated, mice were treated with 1% NAC in drinking water for 8 weeks before 
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sample collection. Each data represent mean of 4 technical replicates. Investigators 
were not blinded to the group allocation when assessing the outcome. 

Blue native electrophoresis (BNE). Supercomplex levels and composition were 
analysed in isolated mitochondria from liver and heart by BNE as described”* 
using the following antibodies: corel, ndufa9 (Abcam); FpSDH, COXI (Novex) 
and Tom20 (Santa Cruz Biotech). Investigators were not blinded to the group 
allocation when assessing the outcome. 

Telomere length analysis. Telomere length was measured in tail skin samples 
by quantitative fluorescence in situ hybridization (Q-FISH). Paraffin-embedded 
tissue sections (7 |1m) were hybridized with a Cy3-labelled PNA probe containing 
the telomere repeat sequence”. DAPI and Cy3 signals were acquired in separate 
channels using a confocal microscope (Leica SP5) fitted with a 60x objective. All 
slides were stained at the same time, and images were acquired consecutively, with 
lasers kept at the same intensity to avoid day-to-day variations. Maximum intensity 
projections from 12-bit image stacks (15 sections at 1.0,1m steps) were generated 
for image quantification. 

The DAPI images were signal-intensity thresholded and segmented using 
ImageJ. Quantitative image analysis was performed using the Metamorph platform 
(Molecular Devices)”*. After conversion to a 1-bit binary mask, the DAPI image 
was used to define the nuclear area. The binary DAPI mask was applied to the Cy3 
image to obtain a combined image with the telomere fluorescence information for 
each nucleus. Cy3 fluorescence intensity was measured in arbitrary fluorescence 
units (au) as the mean grey value in each nucleus. Only cells from the bulge of the 
hair follicle were analysed. Telomere intensity values were exported to Excel. The 
analysis was conducted on one-year-old BL/6“” (n= 6), one-year-old BL/6N”® 
(n=6), two-year-old BL/6“”(n = 4), and two-year-old BL/6N”8 (n =5) mice, and 
two images were acquired per animal. The number of cells analysed per group is 
indicated in the frequency histograms (Fig. 1i). 

The statistical significance of differences in telomere length distributions was 
calculated with a Wilcoxon's rank sum test. A two-tailed t-test was used to com- 
pare the reduction in mean telomere length and the percentages of short and long 
telomeres. Calculations were made in Microsoft Excel and Prism. Differences were 
considered significant at *P < 0.05, at **P< 0.01, and at ***P< 0.001. Investigators 
were not blinded to the group allocation when assessing the outcome. 
Phenotype scoring. Mice aged ~2 years old were scored for a variety of ageing 
markers (reduced hair density, greying hair, kyphosis, body weight reduction and 
development of systemic neoplasias) as previously described". Investigators were 
blinded to the group allocation during the experiment. 

Transcriptomic analysis. Twelve 10-week-old males per genotype were killed by 
cervical decapitation, and liver (median lobe) and heart (left ventricle) tissues were 
dissected and immediately frozen in liquid nitrogen. Total RNA was extracted with 
TRIzol reagent and then purified on RNeasy spin columns (Qiagen). RNA integrity 
(RNA Integrity Score > 7.9) and quantity were determined with an Agilent 2100 
Bioanalyzer. Equal RNA amounts from individual animals were combined at 4 
samples per pool, and 3 pools per genotype were analysed in the Illumina Genome 
Analyser IIx. Reads were pre-processed with Cutadapt 1.2.1 to remove the TruSeq 
adaptor. To focus on nuclear genes, we identified and removed mitochondrial reads 
by aligning all reads to mitochondrial DNA using Tophat. Resulting reads were 
mapped on the mouse transcriptome (Ensembl gene-build GRCm38.v70) and 
genome, using RSEM v1.2.3 and Bowtie2 v2.0.6. Genes with at least one count 
per million in at least three samples were considered for downstream analysis. 
As data were generated in two different sequencing runs, we used the R function 
Combat (http://www.bu.edu/jlab/wp-assets/ComBat/Download.html) to adjust for 
potential batch effects. Statistical analysis was performed on the remaining set of 
genes using the bioconductor package EdgeR”®. Raw reads and TMM-normalized 
batch corrected counts have been deposited in GEO with the accession number 
GSE56933. Sets of genes differentially expressed across conditions were analysed 
for functional associations using IPA (Ingenuity Knowledge Database, http://www. 
ingenuity.com). Benjamini-Hochberg adjusted P values are reported for differen- 
tial gene expression and IPA functional analysis. Gene Set Enrichment Analysis 
(GSEA”) identified sets of co-expressed genes that are functionally linked based on 
different ontologies and/or pathways. Gene sets with a FDR <0.25 will be consid- 
ered to functionally related. Investigators were not blinded to the group allocation 
when assessing the outcome. 

Metabolomic analysis. Samples of liver and plasma from 10-week-old mice (n=6 
per genotype) were collected, frozen in liquid nitrogen and sent to Metabolon 
(Durham, NC, USA) for metabolomics analysis according to their proprietary 
methodology. The analysis profiled 313 metabolites in liver samples and 287 in 
plasma, with 197 common to both sources. 

For the multivariate model analysis, between-group differences in the abun- 
dance of metabolites identified by mass spectrometry (LC-MS and GC-MS) were 
assessed by Welch's two-sample t-test. Differences were considered statistically 
significant at P< 0.05. Chemometric statistical analysis was performed using 
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R scripts (http://cran.r-project.org/) and the mixOmics package (v. 5.0.1) executed 
in RStudio (v. 0.98.501) for MacosX. Most metabolomic analysis employs unsu- 
pervised principal components analysis (PCA) to explore the quality of data and 
detect possible outliers and a supervised partial least squares-discriminant anal- 
ysis (PLS-DA) to determine the number of metabolites responsible. PLS-DA is a 
supervised extension of PCA used to distinguish two or more classes by searching 
for original metabolic variables (X matrix) that are correlated to class member- 
ship (that is, liver, plasma, and mouse model) (Y matrix). Results of the PCA and 
PLS-DA were visualized by a scatter score plot of different PCs (PCA) and the 
first two latent variables (PLS-DA). The original X matrix contains the metabolic 
variables, while the Y matrix contains the class variable for which values are chosen 
to be the class descriptor. We performed the analysis by comparing BL/6°” samples 
with BL/6N73 samples, and liver samples with plasma samples. False discovery 
rate and overfitting were avoided by using a leave-one-out cross-validation, per- 
formed with the calibration data set, to determine the number of latent variables 
for each fold and to construct a model. The resulting model was applied to the test 
data set, including the 287 plasma metabolites and the 313 liver metabolites, to 
compute the predicted Y values. Due to the small number of samples, this process 
was repeated the number of times needed in each analysis so that every sample 
served as an independent test sample once and only once. The predicted Y values 
produced by the cross-model validation were used to examine the classification 
error and determine the number of most discriminant metabolites. Based on this, 
we selected only those metabolites that appear in at least 50% of the repeated 
analysis. For metabolomic set enrichment analysis (MSEA)”’, raw data from the 
metabolic analysis were normalized by quantiles using the Bioconductor LIMMA 
package. Metabolites with missing values were excluded. A moderated t-test, also 
implemented in the LIMMA package, was used to calculate log fold-differences 
between male BL/6N”8 and BL/6“*” mice for liver and plasma samples separately, 
together with raw and Benjamini-Hochberg adjusted P values. MSEA was then run 
on the normalized samples and the signal-to-noise ratio statistic was used to rank 
metabolites in each pathway of interest. A false discovery rate (FDR) <0.25 was 
considered significant. We considered sets of at least 10 metabolites in all analyses, 
except for bile acid, which required at least 5 metabolites to be in the same class for 
a result to be considered significant. 

Proteomic analysis. Heart and liver protein extracts from 12- or 80-week-old 
BL/6°” and BL/6N”8 mice (n =3 per genotype and age) were reducted, alky- 
lated with iodoacetamide and digested with trypsin and the resulting peptides 
were labelled with iTRAQ8-plex, according to the manufacturer's instructions. 
Labelled peptides were injected onto a C-18 reversed phase (RP) nano-column 
(75\1m LD. and 50cm, Acclaim PepMap, Thermo Fisher, San José, CA, USA) 
and analysed in a continuous acetonitrile gradient consisting of 8-31% B for 
240 min, and 50-90% B for 1 min (B =0.5% formic acid in acetonitrile). Peptides 
were eluted from the RP nano-column at a flow rate of ~200 nl min“! to an 
emitter nanospray needle for real-time ionization and peptide fragmentation in a 
Q-Exactive HF mass spectrometer (Thermo Fisher). Mass spectra were acquired 
in a data-dependent manner, with an automatic switch between MS and MS/MS 
using a top 20 method. 

The raw files were analysed with Proteome Discoverer (version 1.4, Thermo 
Fisher Scientific), using a Uniprot database containing a joint Mouse + Rat 
Swissprot database (Uniprot release 57.3 May 2009; 26.885 entries). For database 
searching, parameters were selected as follows: trypsin digestion with 2 maximum 
missed cleavage sites, precursor mass tolerance of 800 p.p.m., fragment mass toler- 
ance of 30 mmu, variable methionine oxidation and fixed cysteine carbamidometh- 
ylation, lysine and N-terminal modification of + 304.2054 (iTRAQ label). The 
same collections of MS/MS spectra were also searched against inverted databases 
constructed from the same target databases. Peptide identification from MS/MS 
data was performed using the probability ratio method”*. FDR of peptide iden- 
tifications were calculated using the refined method”; 1% FDR was used as the 
criterion for peptide identification. Each peptide was assigned only to the best 
protein proposed by the Proteome Discoverer algorithm. Quantitative informa- 
tion was extracted from iTRAQ sample MS/MS spectra using an in-house devel- 
oped program (QuiXoT) and protein abundance changes were analysed using the 
Generic Integration Algorithm (GIA), as described in ref. 30. The GIA was also 
used to integrate the quantitative protein information from several experiments 
into a unique protein value. Calculation of statistical weights of each quantitation 
at the spectrum level was performed according to the WSPP model. The validity of 
the null hypothesis at each one of the levels (spectrum, peptide, protein within an 
experiment and protein) was carefully checked by plotting the cumulative distri- 
butions. Three mice for each genetic background and age were used as biological 
replicates for heart experiments, and two mice were used for liver experiments. 
Changes in relative protein abundance are expressed using the standardized log, 
ratio (Zq) variable*°. Investigators were not blinded to the group allocation when 
assessing the outcome. 
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'H magnetic resonance spectroscopy (MRS). Male BL/6N7® and BL/6°" mice 
(80 weeks old (n= 4-5), 20 weeks old (n= 6-8)) weighing 30-50g each were 
housed in groups of 4 with unrestricted access to food and water. Spectroscopy 
examinations were performed with a 7T preclinical system (Agilent Varian, 
Palo Alto, USA) equipped with a DD2 console and an active shielded 205/120 
gradient insert coil with a 130 mT per m maximum strength. A double-tuned 
circular transmit/receive coil was used for phosphorus/proton (20 mm), located 
mainly on the soleus muscle, heart and liver (Rapid Biomedical GmBH, Rimpar 
Germany). Investigators were blinded to the group allocation during the 
experiment. 

Proton spectra were acquired by 128 transients with 2,048 complex points with a 
spectral bandwidth of 10 kHz and a repetition time of 1.2 ms. Spectra were acquired 
with adiabatic radiofrequency pulses to improve sensitivity and minimize spec- 
tral distortions with an Ernst flip angle. Proton chemical shifts were expressed 
relative to the water signal (4.7 p.p.m.). Signals in nuclear magnetic resonance 
spectra were determined quantitatively on a Macintosh computer by integration 
after automatic or manual baseline correction, with fitting spectrum peak (after 
phase and baseline correction) to a Lorentzian function using the Mestrenova 
program (Mestrelab Research, Santiago de Compostela, Spain; released 2015-02- 
04 version:10.0.1-14719). An exponential line broadening (3 Hz for proton) was 
applied before Fourier transformation. 

Transmission electron microscopy (TEM). Mice were euthanized by cervi- 
cal dislocation and the left ventricle apex was immediately dissected (12 weeks 
old, n=3 per genotype and 80 weeks old, n= 2 per genotype). Heart samples 
were fixed in 2.5% glutaraldehyde, 4% formaldehyde in 0.1 M HEPES buffer 
for 4-5h. After buffer washes, samples were post-fixed for 1h at room tem- 
perature in a 1:1 solution of 1% osmium tetroxide and 3% aqueous potas- 
sium ferrocyanide. Samples were rinsed in distilled H,O. Tissues were 
dehydrated through a graded acetone series and embedded in Spurr’s low 
viscosity embedding mixture (Electron Microscopy Sciences). Ultra-thin 
sections (60 nm) were then mounted on copper grids and stained with lead citrate. 


Samples were examined on a JEOL 10-10 electron microscope through 1,500 x, 
5,000 x, 40,000 x and 80,000 objectives. Mitochondrial morphometry, cristae 
area (2 images, 5,000, per animal) and lipid droplets (50 images, 5,000, per 
animal) were segmented manually and analysed using Fiji (http://fiji.sc/Fiji) and 
ImageJ 1.48v software. The investigator was blinded to the group allocation when 
assessing the outcome. 

Statistical analysis. Unless specified, statistical analyses and graphics were pro- 
duced with GraphPad Prism 6 software. Data sets were compared by unpaired 
two-tailed t-tests. Differences were considered statistically significant at P values 
below 0.05. *P< 0.05; **P< 0.005; ***P< 0.0005. All results are presented as 
mean +s.d. or mean+s.e.m. 
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Extended Data Figure 1 | Conplastic mouse generation and nuclear 
genome identity. a, Schematic of the conplastic animal concept. 

b, Sequence differences between NZB and C57 mtDNAs by next- 
generation sequencing. See also Extended Data Table 1. c, Cluster 
dendrogram based on the expressed genome of the indicated mouse strain 
and the RNA-seq-derived expressed genome of conplastic animals. 
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d, Comparison of nuclear single nucleotide polymorphism (SNP) 
differences between the indicated mouse lines by Affimetrix Mouse 
Diversity Genotyping Array analysis. e, Percentage of genotype differences 
between the mouse nuclear DNA SNPs identified by RNA-seq; the number 
of SNPs compared is indicated. 
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Extended Data Figure 2 | Young conplastic mouse phenotyping. 

a, Weight-gain during early life (BL/6‘”8, BL/6©”, and NZBN”® separated 
by sex, n = 10 per group). b, Organ-weight to body-weight ratios separated 
by sex (1 = 10 per group, ANOVA test). c, Activity profiles (BL/6N2°, 


BL/6°”, and NZBN“® separated by sex, 10-week-old mice, n= 10 per 
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group). d, Area under the curve analysis of the activities shown in 

c, providing estimated activity intensities of the different strains. e, Ratios 
of activity to respiratory parameters in the different mouse strains. Data 
are means + s.d., *P < 0.05, two-tailed t-test. 
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Extended Data Figure 3 | Ageing phenotype of conplastic mice. 

a, Two-year-old mice were scored for conspicuous signs of the indicated 
ageing parameters (0 (none) to 3 (high))'°. b, Representative image of 
2-year-old animals. c, Tumours incidence detected upon liver necropsy 
of animals dying from natural causes (BL/6N78, n = 14; BL/6”, n = 20 
mice). d, Representative haematoxylin and eosin stained paraffin sections 
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of liver adenoma, carcinoma, histiocytic sarcoma and hemangiosarcoma. 
e, Representative histological preparations of ovaries from animals of 

the indicated age and genotype. f, Lipofuscin accumulation over the 
reproductive period of conplastic females (n = 5 females per genotype and 
age) *P < 0.05, two-tailed t-test. 
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Extended Data Figure 4 | Additional transcriptomic data. a, Chord 

plot comparing liver expression of PPARa-regulated genes in conplastic 
mice. The chord plot shows gene changes (logFC) and relates them to their 
functional categories. b, Gene set enrichment analysis (GSEA), showing 
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Extended Data Figure 5 | Differentially regulated metabolites 

and metabolic pathways in liver and plasma. a, PLS-DA-identified 
metabolites responsible for differential metabolic signatures (P, plasma; 
L, liver). The metabolite list was cross-validated, and only metabolites 
present in 50% or more iterations were considered. Colour highlighting 
indicates metabolites as potential markers of the indicated metabolic 
route. b, Glutathione metabolic pathway and heat map showing MSEA 
assessment of all detected metabolites implicated in GSH metabolism, 
regardless of whether they were identified as relevant in PLS-DA (red text 
indicates PLS-DA relevant), and of selected oxidative stress metabolic 
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biomarkers. These data are also shown in Fig. 2d. c, Heat map of detected 
metabolites implicated in bile acid and xenobiotic metabolism. d, Heat 
map of detected metabolites implicated in phospholipid metabolism. 
Heat maps are accompanied by MSEA assessment of significant (red) 

or nonsignificant differences (FDR). Colour-coding in all heat maps 
represents the log fold difference between BL/6N”® and BL/6°” mice 
(black, not detected; white, no change (log fc = 1); red, trend for 
enrichment in BL/6N8 samples (log fc >1); yellow, trend for depletion in 
BL/6NZ8 samples (log fc <1). 
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Extended Data Figure 6 | Quantitative proteomics analysis of 
mitochondrial complexes and functional categories. a, Blue native 
electrophoresis of liver samples from 8-, 12-, 16- and 50-week-old 
conplastic mice. Top: complex I, (Ndufa9); middle: complex HI (core-2); 
bottom: complex IV (Cox5b) and Tom20. b, Ratio of supercomplexed 

to free complex I, determined from the CI western blot in a. c, Relative 
abundance of component proteins of CI, CIV and CII] detected by iTRAQ 
quantitative proteomics in heart and liver of the indicated mouse strain at 
12 weeks and 2 years of age. Each dot represents a single protein; red dots 
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represent mtDNA-encoded proteins. d, Relative abundance of proteins of 
the TCA cycle, glycolysis, or 3-oxidation detected by iTRAQ quantitative 
proteomics in heart and liver of the indicated mouse strain at 12 weeks 
old and 2 years of age. The four succinate dehydrogenase (complex II) 
subunits are highlighted in green. The iTRAQ-detected proteins and Zq 
values represented in c and d are compiled in Supplementary Data 4. 
Data are means + s.d., ANOVA test. *P < 0.05; **P < 0.01; ***P< 0.001; 
#EEED < (2.00001. 
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Extended Data Figure 7 | Lipid metabolism phenotype. a, Serum 


in males fed chow (n= 4 per genotype) or a HED (n=8 per genotype). 


cholesterol levels (ChoT), low-density lipoprotein (LDL), and high-density 
lipoprotein (HDL) in young (2 months old, n = 8 per genotype) and adult 
(12 months old, n= 12 per genotype). b, Three-month body-weight gain 


c, Adipocyte size in epididymal fat pads of HFD-fed conplastic males 
(n= =3 per genotype and diet: BL/6“” chow diet = 368 cells, 
HED = 60 cells; BL/6“78 chow diet = 353 cells, HED = 209 cells). 
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Extended Data Figure 8 | Mitochondrial cristae and density. percentage of cristae area in an individual mitochondria; (BL/6N®, n = 33; 
a, Representative masks of individual heart mitochondria from 12-week-old — BL/6°°’, 1 =31) c, d, Aspect and electro-density of subsarcolemmal (SBS) 
mice (n =3 per genotype). b, The cristae area per mitochondria estimated _ or intermyofibrillar (IMF) mitochondria from left heart ventricular apex 
from masks. Significance was assessed by nonparametric Mann-Whitney of 12-week-old mice. 

U-test. Bars and whiskers represent means + s.d.; each point represents the 
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Extended Data Table 1 | Sequence differences between NZB and C57 mtDNA 
Position C57BL/6 NZB Gen Category Position C57BL/6 NZB Gen Category 


55 G A tRNA Phe non-coding 8568 Cc li ATP6 silent 

716 A G rRNA 16S non-coding 8858 Tl Cc COIll silent 
1353 A G tRNA 16S non-coding 8864 Cc 1, COolll silent 
1519 G A rRNA 16S non-coding 9137 A G COlll silent 
1590 G A rRNA 16S non-coding 9152 Wl Cc COlll silent 
1822 Ai Cc tRNA 16S non-coding 9391 A G tRNA Gly non-coding 
2201 il Cc tRNA 16S non-coding 9461 TT Cc ND3 silent 
2340 G A tRNA 16S non-coding 9530 Cc 1 ND3 silent 
2525 Cc il tRNA 16S non-coding 9581 Cc V ND3 silent 
2766 A G ND1 Missense 9599 A G ND3 silent 
2767 ‘Ali Cc ND1 missense 9829* wW AA tRNA Arg non-coding 
2798 Cc T ND1 silent 10547 Cc ili ND4 silent 
2814 T Cc ND1 silent 10583 A G ND4 silent 
2840 Cc a ND1 silent 10952 Cc A ND4 missense 
2934 Cc A ND1 missense 11843 G A NDS silent 
3194 aly Cc ND1 silent 11846 Cc 1 NDS silent 
3260 A G ND1 silent 11933 A Cc NDS silent 
3422 T Cc ND1 silent 12353 Cc T NDS silent 
3467 T Cc ND1 silent 12575 Wh A NDS silent 
3599 T Cc ND1 silent 12695 A G NDS silent 
3692 A G ND1 silent 12835 4] Cc NDS missense 
3932 G A ND2 missense 12890 A G NDS silent 
4123 Cc Tj ND2 silent 13004 G A NDS silent 
4276 G A ND2 silent 13444 Cc a NDS missense 
4324 li Cc ND2 silent 13612 ll Cc ND6 silent 
4408 G A ND2 silent 13689 Cc a ND6 missense 
4706 A G ND2 missense 13781 A G ND6 missense 
4732 Cc T ND2 silent 13780 1 Cc ND6 missense 
4771 T Cc ND2 silent 13837 A G ND6 silent 
4885 A Cc ND2 silent 13983 A G ND6 silent 
4903 T G ND2 silent 14186 ili Cc CYTB silent 
5204 A AG tRNA Cys non-coding 14211 G A CYTB missense 
5463 G A col missense 14363 A G CYTB silent 
5552 T Cc col silent 14642 G A CYTB silent 
5930 G A col silent 14738 Cc iT CYTB silent 
6041 T Cc col silent 15499 AD A D-Loop intergenic 
6407 Cc ag col silent 15549 Cc TE D-Loop intergenic 
6470 A G col silent 15578 A Tt D-Loop intergenic 
6575 Cc ily col silent 15588 Cc T D-Loop intergenic 
6620 G A col silent 15603 Cc Ap D-Loop intergenic 
6785 G A col silent 15657 1 Cc D-Loop intergenic 
7411 A G Coll silent 15916 Cc T D-Loop intergenic 
7870 G A ATP8 silent 16017 A Cc D-Loop intergenic 
8439 A G ATP6 silent 16268 A G D-Loop intergenic 
8467 ap Cc ATP6 silent 16272 Mt ie D-Loop intergeni 


Differences were identified by mtDNA-NGS. The *9829insAA position was confirmed by Sanger sequencing. 
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Extended Data Table 2 | Analysis of the mtDNA mutation load 


Average 
Mouse type Animal Mutations eee Bacecavedee 
depth 
BL/6°*”_12w 1 8769 99.52% 
BLI6°*”_12w 2 8910 99.19% 
BLI6®*”_12w 3 1 6289 99.08% 
BLI6°*”_12w 4 1 9766 99.45% 
BLI6®*”_12w 5 0) 8923 99.28% 
Total 2 
BLI6**”_40w 1 0) 8806 98.96% 
BL/6°°”_40w 2 0) 8884 99.38% 
BLI6**”_40w 3 0) 8466 99.30% 
BL/6**”_40w 4 2 9451 99.39% 
Total 2 
BL/6™*”_80w 1 0) 8728 99.44% 
BL/6°*”_80w 2 1 8857 99.29% 
BL/6**”_80w 3 0) 9710 99.49% 
BL/6**”_80w 4 1 11139 99.35% 
BL/6®*”_80w 5 0) 9833 99.46% 
Total 2 
BL/6’7®_12w 1 1 10992 99.54% 
BL/6’”?_12w 2 () 9237 99.45% 
BLi6’”"_12w 3 0) 8417 99.49% 
BL/6’7®_12w 4 0 8341 99.44% 
BLi6”?_12w 5 0 9672 99.64% 
Total 1 
BL/62®_40w 1 ) 8898 99.52% 
BL/6”?_40w 2 1 8680 99.49% 
BLi6’”?_40w 3 0) 8090 99.60% 
BL/6‘”?_40w 4 () 8109 99.43% 
Total 1 
BL/6’7®_80w 1 () 9495 99.39% 
BL/6’7®_80w 2 () 8288 99.33% 
BL/6’7®_80w 3 ) 10994 99.53% 
BL/6”?_80w 4 0 8799 99.58% 
BL/6’7®_80w 5 ) 9759 99.39% 
Total 0 


Mouse type Animal Position R/A AA FDP %Het Gen Category 


BLI6’”’ 12w = 3 8258 G/T GN 9962 2,3 ATP6 Missense 

BLI6*’ 12w 4 5636 A/T WIC 9946 4,7 COXI Missense 
BLI6®”’ 40w 4 1025 C/T none 9977 7,0 tRNA‘ Non- 
coding 

BLI6*’ 40w 4 8637 G/A Vil 9943 28,5 COXIIl Missense 
BLI6*’ 80w 2 80 T/G none 8660 2,4 12S Non- 
rRNA coding 
BLI6’*’ 80w 4 18 AIG none 6205 47,5 tRNA‘ Non- 
coding 


BL/6‘7?_12w 7633. G/A none 9979 3,4 COXII Silent 
BLI6’” 40w 2 10624 C/A none 7931 5,8 ND4 Silent 


= 
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Sliding sleeves of XRCC4-XLF bridge DNA and 
connect fragments of broken DNA 


Ineke Brouwer", Gerrit Sitters'*+, Andrea Candelli'+, Stephanie J. Heerema!'+, Iddo Heller!, Abinadabe J. de Melo’, 
Hongshan Zhang’, Davide Normanno?, Mauro Modesti’s, Erwin J. G. Peterman! & Gijs J. L. Wuite!s 


Non-homologous end joining (NHEJ) is the primary pathway 
for repairing DNA double-strand breaks (DSBs) in mammalian 
cells!. Such breaks are formed, for example, during gene-segment 
rearrangements in the adaptive immune system or by cancer 
therapeutic agents. Although the core components of the NHEJ 
machinery are known, it has remained difficult to assess the 
specific roles of these components and the dynamics of bringing 
and holding the fragments of broken DNA together. The structurally 
similar XRCC4 and XLF proteins are proposed to assemble as highly 
dynamic filaments at (or near) DSBs”. Here we show, using dual- 
and quadruple-trap optical tweezers combined with fluorescence 
microscopy, how human XRCC4, XLF and XRCC4-XLF complexes 
interact with DNA in real time. We find that XLF stimulates the 
binding of XRCC4 to DNA, forming heteromeric complexes that 
diffuse swiftly along the DNA. Moreover, we find that XRCC4-XLF 
complexes robustly bridge two independent DNA molecules and that 
these bridges are able to slide along the DNA. These observations 
suggest that XRCC4-XLF complexes form mobile sleeve-like 
structures around DNA that can reconnect the broken ends very 
rapidly and hold them together. Understanding the dynamics and 
regulation of this mechanism will lead to clarification of how NHEJ 
proteins are involved in generating chromosomal translocations** 
It has long been assumed that the NHEJ components assemble 
sequentially, starting with the Ku heterodimer (Ku70 and Ku80 
subunits) binding to DNA ends and protecting them, followed by the 
DNA-dependent protein kinase catalytic subunit (DNA-PKcs), which 
promotes end synapsis. According to this model, end-processing 
factors are recruited next, and DNA ligase 4 (LIG4) in complex with 
XRCC4 stimulated by XLF ligates the broken ends. Recent studies have, 
however, challenged this sequence of events”. In particular, XRCC4 is 
present in large excess over LIG4 (ref. 6) and associates with XLF to 
form complexes promoting LIG4-independent broken end bridging’~? 
Crystal structures and biochemical analysis of XRCC4-XLF complexes 
revealed that they form alternating helical filaments capable of bridging 
DNA!13. The biological relevance of these filaments is the subject of 
current studies. Recent super-resolution microscopy of human U2O0S 
cells showed that XRCC4, XLF and LIG4 assemble as filamentous 
structures adjacent to bleomycin-induced DSBs”. XRCC4 and XLF 
can form similar structures, even in cells lacking LIG4, indicating that 
XRCC4-XLF filaments have a central role in NHEJ. To gain insight 
into the dynamics of these filaments and their interactions with DNA, 
we used a single-molecule approach combining dual optical trapping, 
microfluidics and fluorescence microscopy!*!®. To study DNA 
bridging, we needed to extend this approach by adding two additional 
traps to our instrument, thus allowing the independent manipulation 
of two double-stranded DNA (dsDNA) molecules and the direct 
visualization of interactions between them. This instrument allowed 


us to investigate the bringing and holding together of DNA fragments 
by human XRCC4 and XLE key intermediate steps in NHE). 

First, we studied the interaction of fluorescently labelled XLF 
and XRCC4 (Extended Data Figs 1-3) on DNA held taut by two 
optically trapped microspheres (Fig. 1). Such experiments show that 
both proteins bind to end-occluded dsDNA but not to stretched 
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Figure 1 | Interaction of XRCC4 and XLF with DNA. a, b, d-f, 
Kymographs of dsDNA with 10 nM XLF (3 ms pixel integration time) 

(a), 100nM XLF (b), 7nM XRCC4 (d), 120nM XRCC4 (e) and 

7nM XRCC4 and 100 nM XLF (only XRRC4 was excited) (f). 

c, g, Concentration-dependent DNA-binding of XLF (c) and XRCC4 (g). 
AU, arbitrary units. Error bars denote s.e.m. h, Kymograph of XRCC4 
and XLF binding. At tensions above ~65 pN the DNA overstretches by 
unpeeling from DNA ends until 18% of the molecule consists of ssDNA. 
No binding of XLF and XRCC4 to ssDNA was observed. i, Kymographs 
of XRCC4 and XLF binding in the presence of proteins (left) and after 
exchange with protein-free buffer (right; same construct and settings, 

6s later). j, Force-extension curve of dsDNA incubated (under tension) in 
200nM XRCC4 and 200 nM XLF (blue) compared to bare dsDNA (black). 
Inset, simultaneous kymograph. Dark regions in the DNA appearing 
above ~60 pN are due to ssDNA generated by force-induced melting. Red, 
enhanced green fluorescent protein (eGFP)-labelled XLF; green, XRCC4- 
Alexa Fluor 555; yellow, colocalization. Scale bars, 44:m and 5s. Pixel 
integration time: 3 ms (a) or 0.1 ms (others). Data are representative of 

13 (a), 17 (b), 15 (d), 11 (e), 10 (f), 6 (h), 12 (i) and 8 (j) experiments. 
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single-stranded DNA (ssDNA) (Fig. 1h and Extended Data Fig. 4a). 
In the absence of XRCC4, XLF binds to DNA transiently at low 
concentrations (Fig. la), and forms stable oligomers at higher 
concentrations (Fig. 1b), a large fraction of which is able to diffuse 
over DNA. The amount of XLF bound to the DNA increased drastically 
with increasing XLF concentrations and saturated around 100 nM 
(Fig. 1c), suggesting cooperative binding. XRCC4 exhibits a different 
behaviour: at all concentrations tested, XRCC4 binds DNA in a less 
cooperative manner, forming stable oligomers that are able to diffuse 
over DNA (Fig. 1d, e, g). Both XLF and XRCC4 oligomers appeared to 
bind directly from solution and, once bound to the DNA, oligomers 
did not appear to grow in size (Extended Data Fig. 4e). 

Next, we addressed the interaction between the two proteins. In the 
presence of XLE the affinity of XRCC4 for the DNA was enhanced 
(Fig. 1f, h) and XRCC4 was distributed more homogenously over 
the DNA (compare Fig. 1d to 1f). By contrast, XRCC4 did not 
significantly alter the affinity of XLF for DNA (Fig. 1c). We noticed 
that when a DNA molecule was moved to a protein-free solution 
after incubation in the presence of XLF and XRCC4, the fluorescence 
signal decreased much faster than the estimated bleaching rate 
(Fig. li), indicating that proteins (presumably dimers) were dissociating 
rapidly (that is, within 10 ms) and thus were bound transiently to the 
DNA (Extended Data Fig. 4d). The proteins that remain are assembled 
in oligomers (Extended Data Fig. 4b, c), most of which diffuse (88 
out of 94). Moreover, these protein complexes dissociate at least four 
orders of magnitude slower (tens of seconds to tens of minutes) than the 
transiently bound dimers (Extended Data Figs 4f and 5g, h). Of these 
oligomers, 29 + 5% (mean +s.e.m.) contained both XRCC4 and XLF, 
9+ 3% only XRCC4 and 62 + 8% only XLF (of a total of 94 detected 
oligomers at a concentration of 50nM each). Together, these results 
suggest that XLF regulates the loading of (heteromeric) complexes 
onto DNA, possibly by influencing the oligomeric state of XRCC4 in 
solution!”'®, 

A large fraction of the stably bound complexes containing XLF, 
XRCC4 or both switched back and forth between static and diffusive 
binding (36 out of 94) (Extended Data Fig. 6a). A smaller fraction 
seemed to be fixed (>120s) on the DNA (4 out of 94), illustrating 
that the complexes can interact with the DNA in different ways. 
The typical switching time between the static and the mobile state is 
approximately 10s (Extended Data Fig. 6f). For the mobile fraction, 
diffusion coefficients were 0.7 £0.1 jim? s~!, 0.7 £0.1 1m’ s~! and 
0.6+0.2 yum? s! for XLF, XRCC4 and XRCC4-XLE complexes, 
respectively, at 25 mM KC] (Extended Data Fig. 6b). These values 
were not strongly dependent on DNA tension and complex size, but 
increased at higher ionic strength (Extended Data Fig. 6c, d). Under 
all conditions tested, the diffusion was faster than would be expected 
for complexes undergoing rotation-coupled diffusion along the DNA” 
(Extended Data Fig. 6e), suggesting sliding without tracking the helical 
pitch. Furthermore, the mechanical properties of DNA were not altered 
upon binding of XRCC4 and XLE even at high density (Fig. 1j). On the 
basis of these observations, we propose that, in mobile complexes, the 
proteins bind around the DNA as a sleeve: once loaded, the stably bound 
complexes slide along the DNA in a diffusive manner experiencing 
little friction. 

Next, we addressed whether these XRCC4-XLF complexes have a 
role in bringing and holding DNA fragments together, crucial steps in 
NHE). To study this in a well-controlled way (Extended Data Fig. 7), we 
developed an instrument combining quadruple-trap optical tweezers 
with wide-field fluorescence microscopy. Using this instrument, we 
independently trapped two end-occluded dsDNA molecules, held them 
taut and wrapped them (Extended Data Fig. 8). We incubated these 
intertwined dsDNA molecules in a channel containing 200 nM XLF 
and 200nM XRCC4 for 2 min. We moved these DNA molecules into 
a protein-free buffer, unwrapped them and visualized whether bridge 
formation had occurred using fluorescence. The mechanics of the 
bridge were probed by moving the microspheres and detecting the 
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Figure 2 | DNA bridging by XRCC4 and XLF. a, Fluorescence images 
(bottom) and corresponding cartoons (top) of two bridged dsDNA 
molecules after wrapping, incubating (2 min in 200nM XRCC4 and 
200nM XLF) and unwrapping. At 4, 24, 62 and 87 s images are overlays 
of green and red images. Arrows denote bridge location. X marks 
microsphere detached from bridge. b, Intensity profiles (top) and 
logarithmically scaled kymographs (bottom) along DNA indicated in 
insets. Dashed line denotes bridge position. c, Distribution of rupture 
forces of bridged complexes (such as shown at 87 s in a). Open circles 
measured in 50 mM potassium acetate, 20 mM Tris-acetate, 10 mM 
magnesium acetate and 100g ml! BSA (pH 7.9), filled circles in 20 mM 
Tris-HCl (pH 7.6), 25 mM KCl and 1mM DTT. Inset, force-extension 
curve of such a complex. Green, XRCC4-Alexa Fluor 555, red, eGFP-XLF, 
yellow, colocalization. Scale bars, 5s and 11m. Data are representative 
examples of 117 experiments. 


resulting forces. An example of such an experiment is shown in 
Fig. 2a (see also Supplementary Video 1 and Extended Data Fig. 9). 
In this experiment, the fluorescence image showed that a bridge 
was present after incubating and unwrapping the DNA molecules 
(Fig. 2a, 4s). At the DNA-DNA interaction site, a yellow patch is clearly 
visible, demonstrating that both XRCC4 and XLF are present at the 
bridge. Subsequently, we moved the bottom-right microsphere to apply 
a high force (>100 pN) until we observed a sudden force drop, due to 
one of the DNA ends detaching from the bottom-right bead (Fig. 2a, 
24s). Thereafter, the top-right microsphere was moved, resulting in the 
detachment of a second DNA end from this microsphere (Fig. 2a, 62s). 
The resulting tether now consisted of two DNA fragments, originating 
from different DNA molecules, held together by XRCC4-XLE. The 
mechanical properties of this protein-DNA complex were indistin- 
guishable from those of bare dsDNA, with a different length from the 
original DNA molecules. This experiment directly demonstrates that 
XRCC4-XLF protein complexes can bridge and hold together broken 
DNA fragments. 

To quantify this bridging behaviour, we repeated the dual- DNA 
experiment 203 times. From these experiments, several key features of 
bridging were extracted. In contrast to the observation on single DNA 
molecules that only ~30% of the complexes contained both XRCC4 
and XLF all 152 bridges between two DNA molecules contained both 
proteins (Extended Data Fig. 10). Bridging was observed in two- 
thirds of the cases, both for wrapped (140 out of 183 tests) and crossed 
(12 out of 20 tests) DNA pairs (Extended Data Fig. 8). Bridges also 
formed between DNA pairs that were pre-incubated with proteins and 
subsequently wrapped/unwrapped in the absence of protein (Extended 
Data Fig. 5a, b). XRCC4-XLF bridges were stable for at least 90 min at 
low force (Extended Data Fig. 5g, h). High force applied to the bridge 
resulted in rupture: in 9 out of 96 cases the protein bridge ruptured 
(Extended Data Fig. 5e, f), and in the remainder of the cases either 
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Figure 3 | Mobility of XRCC4-XLF bridges. a, b, Overlays of eGFP-XLF 
logarithmically scaled fluorescence before (cyan) and after (magenta) 
moving a microsphere (white arrow). DNA molecules were incubated 

in crossed configuration. Circles denote microsphere positions; dashed 
lines denote DNA; coloured arrows denote bridge location. Data are 
representative examples of 13 experiments. c, Distance from bridge 

to bead edges (of complex shown in a and b) as a function of time. Shaded 
regions denote bead motion as indicated by the arrows in the cartoons. 

d, Histogram of XRCC4-XLF bridge mobility (both wrapped and crossed 
configurations). Error bars denote s.e.m. 
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one of the DNA molecules broke or detached from a microsphere 
(Fig. 2a, 24s). To assess the strength of the bridges more directly, we 
determined the rupture forces of bridged DNA fragments tethered 
between two microspheres (for example, the tether in Fig. 2a, 62-875). 
Such complexes ruptured over a wide range of forces (Fig. 2c). Most 
events occurred during DNA overstretching (at ~65 pN), indicating 
breakage of the DNA due to nicks” and therefore not rupture of the 
protein bridges. A substantial fraction broke at higher forces, up to 
250 pN, by detachment of the biotin—-streptavidin DNA-microsphere 
linkers”!, by rupture of the protein bridge, or by the DNA sliding out 
of the sleeve. Together, these experiments indicate that XRCC4-XLF 
complexes readily form highly stable and strong (at least >65 pN) 
DNA-protein bridges. 

To relate our observations of the stable XRCC4-XLF bridges 
between two DNA molecules to the highly mobile sleeves present 
on single DNA molecules, we tested whether the bridges are also 
mobile. In our quadruple-trap assay, we moved one DNA molecule 
with respect to the other and tracked the position of the protein bridge 
in the fluorescence images (for example, Fig. 3a—c, Extended Data 
Fig. 9 and Supplementary Video 2). In 47 out of 60 such experiments, 
we observed that bridges could slide along one (22 cases) or both 
(25 cases) of the DNA molecules (Fig. 3d). This mobility indicates 
that the bridges consist of one or a few sleeves. A single sleeve could 
surround two DNA molecules, allowing them to slide independently 
within the sleeve. Alternatively, two mobile sleeves, each surrounding 
a single DNA molecule, might interact to form a bridge. The static 
binding mode observed on single DNA molecules (Extended Data 
Fig. 6a) might explain why some of the bridges cannot slide along one 
or both DNA molecules. 

The observations so far, using end-occluded DNA, raise the question 
of how XRCC4-XLF complexes interact with DNA fragments and 
DNA ends. Fast bridge formation between the two fragments is vital 
for NHEJ since it avoids interactions that could lead to chromosomal 
translocations. To study the interaction of XRCC4-XLF with DSBs, 
XRCC4-XLF bridges (n = 30) were exposed to the restriction enzyme 
Sfol, inducing a single DSB in both DNA molecules. Figure 4a-d 
illustrates such an experiment (see also Supplementary Video 3). 
This experiment was performed in slight buffer flow, such that, after 
introducing DSBs, the DNA ends were extended in the direction of 
the flow to prevent the DNA from bridging onto itself. XRCC4-XLF 
bridges are most of the time still capable of sliding along the DNA 
fragments (Fig. 4e). At some point, this sliding is stalled, probably when 
the bridge runs into a stationary XLF and/or XRCC4 complex or a DNA 
end. Typically, we do not see the DNA ends sliding out of the bridge 
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Figure 4 | XRCC4-XLF = keep DNA fragments together. 

a-d, Inverse fluorescence images of XRCC4—Alexa Fluor 555 eGFP- 

XLF bridge (see Supplementary Video 3; 532 nm excitation). DSBs are 
introduced using Sfol, cleaving each DNA in one site. Free ends extend in 
buffer flow (black arrows). Blue arrows denote location of one XRCC4— 
XLF complex. e, Relative position of bridge shown in b-d between the 
beads. Inset denotes contour length and force as a function of time. 

f, Force-extension curve of the complex in b-e. g, Fluorescence image of 
proteins at the end of a 32.2-kb flow-stretched blunt-ended SYTOX Red- 
stained DNA molecule. Green, XRCC4—Alexa Fluor 555; red, eGFP—XLF; 
blue, SYTOX Red; white, colocalization. Arrow shows flow direction. 
Inset, zoom with different intensity scaling. h, Proposed mechanism 

for XRCC4 (green) and XLF (red) in bridging and holding together 

DNA fragments (grey) after a DSB (scissors). Scale bars, 2 1m. Data are 
representative examples of 30 (a—f) and 14 (g) experiments. 


when the bridge reaches the end of the DNA fragment (Fig. 4f). These 
experiments show directly that after DSB formation, XRCC4—XLF 
bridges are able to hold together the broken DNA fragments such that 
their ends remain in close proximity to each other, while the mobility 
of the complexes ensures that the ends remain accessible for further 
processing by other repair factors. 

To study more directly what happens when an XRCC4-XLF complex 
encounters a free DNA end, we tethered a blunt-ended dsDNA 
molecule to a single microsphere and extended it using buffer flow. 
We expected that DNA-bound XRCC4-XLF complexes are driven 
towards the DNA end by the buffer flow. Indeed, in fluorescence images 
recorded after moving the construct to a protein-free environment, we 
observed accumulation of XRCC4 and XLF at the DNA end (Fig. 4g) 
in all 14 experiments. This suggests that some of the protein sleeves 
do not seem to slide off the blunt end under the influence of buffer 
flow. However, further experimentation is required to understand how 
sleeves intrinsically interact with ends and how they are affected by 
other factors, such as Ku. 

In this study we have demonstrated that the combination of dual- 
and quadruple-trap optical tweezers with fluorescence microscopy is 
a powerful tool to study the roles of XRCC4 and XLF in NHEJ. Our 
experiments show that binding of XRCC¢4 is stimulated by XLF and 
that together they form stably bound, yet mobile complexes on dsDNA. 
These complexes can form strong, mobile DNA bridges capable of 
holding the DNA fragments in close proximity to one other. Our and 
previous” observations can be explained as follows (Fig. 4h): XRCC4 
and XLF bind to the DNA fragments of a DSB in a highly diffusive 
manner. These complexes can bridge the DNA fragments at any 
location, preventing the fragments from moving apart. The mobility 
of the bridges could allow them to slide along the DNA leaving the ends 
accessible for processing. Our method with the capability to observe 
directly how XRCC4-XLF complexes dynamically interact with DNA 
and each other in NHE] can be used in future experiments to address 
remaining questions in NHE]J, including how the loading of XRCC4- 
XLF complexes is restricted to broken DNA, how they are remodelled 
and regulated, and how Ku, DNA-PKcs, LIG4 and other NHE] factors 
come into play. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Dual-trap optical tweezers with confocal fluorescence. The optical set-up 
used for dual-trap optical trapping experiments has been described previously'®. 
In brief, two optical traps are generated using a 10 W 1,064nm CW fibre laser 
(YLR-10-LP, IPG Photonics), of which the beam is expanded using a 75mm and 
150mm lens. To align the trapping beam with the confocal DNA imaging, the 
collimation can be modified by translating the 75 mm lens on an automated stage 
(AG-LS25). The trapping beam is split into two independently steerable paths 
using a polarizing beam splitter. In one trap, steering is achieved with an accurate 
piezo mirror (Nano-MTA2X Aluminium, Mad City Labs) and in the other trap 
with a coarse positioning piezo stepper mirror (AG-M100N). After recombining 
the two paths they are coupled into the objective via two 300 mm lenses. Back-focal 
plane interferometry is used for force measurements and the microsphere positions 
are determined using template-matching on images obtained from bright field 
LED illumination on a metal-oxide semiconductor (CMOS) camera (DCC1545M, 
Thorlabs). For the confocal fluorescence excitation two excitation bands (centred 
at 467 nm and 543 nm) from a single laser system (ALP-710-745-SC, Fianium Ltd) 
are used. We selected these bands using an AOTF (AOTFnc-VIS-TN, AA Opto- 
Electronic). For beam scanning, a fast tip/tilt piezo mirror (S-334.1SD, Physik 
Instrumente GmbH & Co, scan rate up to 200 Hz) is used. For detection, fibre- 
coupled APDs (APDs SPCM-AQRH-14-FC, fibres SPCM-QC9, Perkin Elmer) 
are used. The multimode fibres (621m diameter) serve as pinholes that provide 
the background rejection. 

Quadruple-trap optical tweezers with wide field fluorescence. The setup used 
for dual DNA manipulation using quadruple-trap optical tweezers has been 
demonstrated previously”. In brief, four optical traps are created using a 20 W 
1064nm CW fibre laser (YLR-20-LP-IPG, IPG Photonics) with a coupled optical 
isolator. The laser beam is split in two paths by a polarizing beam splitter after 
which they are expanded using lenses with focal lengths of 75mm and 150mm. 
Both beams are split again, one by a 50/50 beam splitter and one by a polarizing 
beam splitter after the polarization is rotated 45° by a half-wave plate, creating four 
optical paths of which three have the same polarization. The 150 mm lenses are 
placed on a linear stage (AG-LS25, Newport) to modify the collimation and with 
that the height of the traps, enabling manipulation of 2 of the traps in 3 dimensions. 
Beam steering of three traps is done by course-positioning piezo stepper mirrors 
(AG-M100N) and one is steered by an accurate piezo mirror (Nano-MTA2X 
Aluminium, Mad City Labs). The four optical paths are recombined into two 
paths using a 50/50 beam splitter cube (in this way, 50% of the power is lost) 
and a polarizing beam-splitter cube, after which these two paths are recombined 
using 50/50 beam splitter cube (here, again 50% of the laser power is lost). Two 
300 mm lenses are used to couple the beam into the objective. Force detection on 
two trapped microspheres is achieved on two position sensitive detectors (PSDs, 
DL100-7PCBA3, Pacific Silicon Sensor) by back focal plane interferometry. 
To detect the force on the first trapped microsphere, three paths with the same 
polarization are separated and dumped by a polarizing beam splitter. The light 
from the trapped microsphere with the unique polarization is then detected on 
one of the PSDs enabling force detection. To detect the force on the second trapped 
microsphere, the optical path of a 980 nm laser (IQ2C140, Power Technology) is 
then overlapped with the path of one of the three traps with identical polarization. 
Force detection is achieved by detecting the 980 nm light on a second PSD. 
Microsphere-to-microsphere distances are measured by template-matching on 
images created by LED on a complementary metal-oxide semiconductor (CMOS) 
camera (DCC1545M, Thorlabs). Fluorescence excitation is performed in an epi-, 
wide-field illumination by a 491 nm laser (Calypso 50, Cobolt), a 532 nm laser 
(Compass 215M-20, Coherent) and a 635 nm laser (IQ1C10(635-12B), Power 
Technology). The emitted fluorescence signal is detected after a band-pass filter 
and imaged on an EMCCD camera (Cascade 512 B, Princeton Instruments). 
DNA, proteins and buffers. End-biotinylated bacteriophage \ DNA was con- 
nected to streptavidin-coated polystyrene beads to generate the DNA constructs as 
described previously”’. For the experiments with DNA blunt ends, the restriction 
enzyme Pmel (NEB) was used to cleave the DNA and generate blunt ends. 


The streptavidin microspheres (4.5 1m diameter, Spherotech) were attached to the 
ends of the DNA using laminar flow”’. 

All recombinant proteins have been expressed and purified from Escherichia 
coli cells. Biochemical analysis of the XRCC4 and XLF variants are described in 
Extended Data Figs 1-3 (expression vectors are available upon request) using 
previously established procedures'***, Concentrations indicated throughout the 
text are the injected concentrations. Because a fraction of the protein is depleted 
from the solution as it sticks to the plastic and glass in the flow-system (u-Flux, 
LUMICKS B.V.), these are considered upper limits of the actual concentration 
present during measurements. 

XRCC4 fluorescent labelling was performed with maleimide Alexa Fluor 
555 at position 93 or position 218. XLF fluorescent labelling was performed with 
maleimide ATTO 647N at position 148 or with eGFP. For all protein samples 
used, protein activity was similar to wild-type activity. This was confirmed 
by biochemical DNA bridging assays (Extended Data Figs 1-3). Mass spectrometry 
analysis was performed in order to quantify the degree of labelling of the 
proteins (Extended Data Figs 1-3). In single-molecule assays, no difference 
was detected between the two different XRCC4 labelling positions (position 93 
or 218). 

Mass analyses were performed on a MALDI-TOF-TOF Bruker Ultraflex II 
spectrometer (Bruker Daltonics) controlled by the Flexcontrol 3.0 package (Build 
51). This instrument was used at a maximum accelerating potential of 25kV and 
was operated in linear mode and the m/z range from 20,000 to 80,000. The laser 
frequency was fixed to 100 Hz and approximately 1,000 shots by sample were 
cumulated. Five external standards (Protein Calibration Standard II, Bruker 
Daltonics) were used to calibrate each spectrum to a mass accuracy within 
100 p.p.m. Peak picking was performed with Flexanalysis 3.0 software (Bruker) 
with an adapted analysis method. One microlitre of sample was mixed with 1 il of 
a saturated solution of Sinapinic Acid in an acetonitrile/0.1%TFA mixture (30:70). 
Then 111 was spotted on the target, dried and analysed. 

Protein-DNA interactions and force-stretching experiments were conducted in 
20mM Tris-HCl (pH 7.5), 1 mM DDT and, unless otherwise indicated, at 25 mM 
KCL. 

The restriction enzyme Sfol used for the induction of DSBs was obtained 
commercially from NEB. Experiments were performed in 40 U ml! Sfol in 
CutSmart buffer supplied with the enzyme. 

Protein binding. Binding of XRCC4 and XLF was quantified by quantifying the 
fluorescence. This was done by determining the average amount of photons per 
second and per watt of excitation power. All values were background corrected. 
Note that due to the difference in quantum and instrumental detection efficiency 
of eGFP and Alexa Fluor 555 the absolute amount of binding of XRCC4 (Fig. 2c) 
and XLF (Fig. 2g) cannot be compared directly. 

Protein diffusion. The one-dimensional diffusion of protein complexes along 
the DNA was quantified by tracking the mobile proteins for a large number of 
frames (typically more than 100) and calculating their diffusion coefficient 
(D) by using a mean square displacement analysis (MSD)** (Extended Data 
Fig. 6b). The diffusion coefficient obtained at low salt concentrations (25 mM KCl) 
was 0.73 + 0.09 um? s~! for XLF complexes, 0.68 + 0.09 1m? s~! for XRCC4 
complexes and 0.6 + 0.2,1m? s_! for complexes containing both XRCC4 and 
XLE These values are independent of DNA tension (Extended Data Fig. 6c) and 
complex size (Extended Data Fig. 6d), but increase with increasing salt 
concentration: at 160mM KCl, the diffusion constant was 2.0 + 0.2,1m? s~! for 
XRCC4 complexes. 


22. Laurens, N. et al. Alba shapes the archaeal genome using a delicate balance 
of bridging and stiffening the DNA. Nat. Commun. 3, 1328 (2012). 

23. Gross, P., Farge, G., Peterman, E. J. G. & Wuite, G. J. L. Combining optical 
tweezers, single-molecule fluorescence microscopy, and microfluidics for 
studies of DNA-protein interactions. Methods Enzymol. 475, 427-453 
(2010). 

24. Andres, S. N., Modesti, M., Tsai, C. J., Chu, G. & Junop, M. S. Crystal structure 
of human XLF: a twist in nonhomologous DNA end-joining. Mol. Cell 28, 
1093-1101 (2007). 

25. Heller, |. et al. Mobility analysis of super-resolved proteins on optically 
stretched DNA: comparing imaging techniques and parameters. 
ChemPhysChem 15, 727-733 (2014). 
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Extended Data Figure 1 | See next page for caption. 
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Extended Data Figure 1 | Generation of human XLF ATTO-647N- 
labelled fluorescent variant by the ‘Cys light’ method. a, Domain 
architecture of the XLF polypeptide and amino acid sequences of the 
wild-type and Cys148 variant obtained by site-directed mutagenesis. All 
Cys residues were changed to Ser except Cys148, leaving a single solvent 
accessible Cys residue. Three-dimensional model of the XLF dimer where 
the position-specific labelling sites are indicated by the black arrows. 

b, Denaturing and reducing polyacrylamide gel electrophoresis of XLF 
Cys148 variant after labelling with ATTO 647N maleimide. Left, bright- 


field image of the gel before staining. Centre, the emission of ATTO 647N. 


Right, is an image of the gel after staining with Coomassie. M, molecular 
mass markers. c, Mass spectrometry of the labelled full-length protein. 
Top, the spectra of XLF Cys148 (red) after labelling with ATTO 647N 


and compared to the wild-type unlabelled protein (black) giving 
Am=790 Da. Bottom, the calculation of the degree of labelling (D.O.L.), 
as the relative integrated intensity of the labelled protein (dark grey area) 
versus the unlabelled protein (light grey area) signals, giving 0.82 dye per 
monomer. d, DNA bridging activity of the XLF variants. Left, a scheme of 
the bridging assay in which an end-biotinylated 1,000-bp DNA fragment 
is coupled to streptavidin-coated magnetic beads. Protein-mediated DNA 
bridging is assessed by recovery of an unlabelled 500-bp DNA fragment. 
Right, image of an agarose gel stained with ethidium bromide to detect 
the bridged 500-bp DNA fragment (indicated by the black arrow) and the 
corresponding quantification. Asterisk indicates biotinylated 1,000-bp 
DNA fragment adsorbed non-specifically on the surface of the magnetic 
beads. 
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Extended Data Figure 2 | Generation of human XLF fluorescent 
variants by eGFP tagging. a, Domain architecture of the XLF polypeptide 
and amino acid sequences of the C- and N-terminal eGFP fusions. 

b, Denaturing and reducing polyacrylamide gel electrophoresis of the 
purified eGFP fusions. c, DNA bridging activity of the XLF variants. 

Top, a scheme of the bridging assay where an end-biotinylated 1,000-bp 


DNA fragment is coupled to streptavidin-coated magnetic beads. Protein- 
mediated DNA bridging is assessed by recovery of an unlabelled 500-bp 
DNA fragment. Bottom, an image of an agarose gel stained with ethidium 
bromide to detect the bridged 500-bp DNA fragment (indicated by the 
black arrow). Asterisk indicates biotinylated 1,000-bp DNA fragment 
adsorbed non-specifically on the surface of the magnetic beads. 
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Extended Data Figure 3 | Generation of human XRCC4 fluorescent 
variants by the ‘Cys light’ method. a, Domain architecture of the XRCC4 
polypeptide and amino acid sequences of the wild-type, C93 and C218 
variants obtained by site-directed mutagenesis. All Cys residues were 
changed to Ala except, respectively C93 and C218, leaving for each variant 
a single solvent accessible Cys residue. Three-dimensional models of the 
XRCC4 dimer where the position-specific labelling sites are indicated 

by the black arrows. b, Denaturing and reducing polyacrylamide gel 
electrophoresis of XRCC4-C93 and -C218 variants after labelling with 
Alexa Fluor 555 maleimide. Left, bright-field image of the gel before 
staining. Centre, the emission of Alexa Fluor 555. Right, an image of the 
gel after staining with Coomassie. c, Mass spectrometry of the labelled 
full-length proteins. Top, the spectra of XRCC4-C93 (orange) and 
XRCC4-C218 (green) after labelling with Alexa Fluor 555 and compared 
to the wild-type unlabelled protein (black), giving Am = 830 Da and 


Am = 835 Da for labelled XRCC4-C93 and XRCC4-C218, respectively. 
Bottom, the calculation of the degree of labelling, as the relative integrated 
intensity of the labelled protein (dark grey area) versus the unlabelled 
protein (light grey area) signals, giving 0.91 and 0.94 dye per monomer 
for XRCC4-C93 and XRCC4-C218, respectively. d, DNA bridging activity 
of the XRCC4 variants. Top, a scheme of the bridging assay in which an 
end-biotinylated 1,000-bp DNA fragment is coupled to streptavidin- 
coated magnetic beads. Protein-mediated DNA bridging is assessed 

by recovery of an unlabelled 500-bp DNA fragment. Bottom, image of 
agarose gel stained with ethidium bromide to detect the bridged 500-bp 
DNA fragment (indicated by the black arrow) and the corresponding 
quantification (the image is from the same gel as in Extended Data Fig. 1d 
but only the lanes relevant to the analysis of fluorescently labelled XRCC4 
are shown). Asterisk indicates biotinylated 1,000-bp DNA fragment 
adsorbed non-specifically on the surface of the magnetic beads. 
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Extended Data Figure 4 | Quantification of XRCC4 and XLF binding 
to dsDNA. a, Fluorescence image of an overstretched dsDNA molecule in 
the presence of 100nM eGFP-XLE. Under the given salt conditions and at 
this DNA extension, significant DNA melting and formation of ssDNA is 
expected. XLF shows a high affinity to bind to dsDNA (bright fluorescent 
signal), whereas it does not appear to bind to ssDNA (dark region). 

b, c, Size of the DNA-bound protein complexes for XRCC4 (b) and XLF (c) 
as determined from their fluorescence intensities (after incubation with 
50nM XRCC4 (b) or XLF (c)). d, Sections of a kymograph measured in 
the protein channel (25 nM eGFP-XLF). Short protein-binding events are 
visible as brief, local bursts of fluorescence. Events are shorter than the 
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line scan time (10 ms). e, Typical intensity time traces of bound XRCC4 
and XLF indicate that the complexes bind as a whole from solution and 
not monomer by monomer. The decrease of the fluorescence intensity 

in time is due to photobleaching of the fluorophores. f, Two successive 
kymographs (separated by white line). Two separate kymographs are 
recorded owing to technical limitations on the maximum recoding time 
of XRCC4-Alexa Fluor 555 binding measured using a very low excitation 
power to reduce the effect of photobleaching. Under these conditions, 
XRCC4 oligomers stay bound for long periods of time (in the order of 
several minutes). Scale bars, 5s and 21m. Data are representative examples 
of 7 (a), 13 (d) and 5 (f) experiments. 
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Extended Data Figure 5 | Properties of protein bridges. a, b, Bridging 
also occurs in the absence of free protein in solution. Fluorescence images 
(XRCC4-Alexa Fluor 555 and XLF-ATTO 647N fluorescence in a, and 
XRCC4-Alexa Fluor 555 fluorescence in b) acquired in the absence of 

free protein in solution of two XRCC4-XLF-coated dsDNA molecules 
before (a) and after (b) wrapping and subsequent unwrapping shows 

that bridging does not require the presence of free protein in solution. 
Representative example out of 2 experiments. c, d, DNA-protein bridges 
also occur in the presence of either XRCC4 or XLF. Fluorescence images of 
bridges formed by wrapping and subsequent unwrapping in the presence 


of XRCC4-Alexa Fluor 555 (c) or eGFP-XLF (d). Representative example 
out of 10 (c) or 19 (d) experiments. e, f, In ~5% of the cases, rupture of 
protein bridges resulted into two intact dsDNA molecules bound with 
XRCC4-Alexa Fluor 555 and XLF-ATTO 647N. Fluorescence images 
before (e) and after (f) rupture of such a protein bridge. Representative 
example out of 9 experiments. g, h, Fluorescence images taken of an 
XRCC4-XLF bridge at t=0 min (g) and t= 95 min (h), showing that 

the bridge and the protein complexes remain stably bound to the DNA 
segments. Green, XRCC4; red, XLF, yellow, colocalization. Scale bars, 
2m. 
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Extended Data Figure 6 | Quantification of diffusion behaviour of 
XRCC4 and XLF on dsDNA. a, Typical kymographs showing switching 
of XLF-ATTO 647N (red signal, top) XRCC4—Alexa Fluor 555 (green 
signal, middle) and XLF-XRCC4 (yellow signal, bottom) complexes 
between static and diffusive states. Scale bars, 1 s and 21m. b, Typical 
MSD curve of an individual XRCC4-Alexa Fluor 555 complex. Inset, 
kymograph of corresponding complex. Red line denotes linear fit 

to the first three data points; from the slope, a diffusion coefficient of 
0.51 41m s~! is determined. See Methods for details on MSD analysis. 

c, Diffusion coefficients of mobile XRCC4 and XLF complexes at different 
salt conditions (low salt: 25 mM KCL, high salt: 160 mM KCl) and DNA 
tensions. d, Diffusion coefficients of mobile XRCC4 protein complexes 


LETTER 


Gd xRcc4 
sy XLF 


[9 XRCC4+XLF 


as a function of complex size. Grey circles, individual data points; black 
circles, average of 8 successive data points. e, Comparison of the observed 
diffusion coefficients of XRCC4 (green), XLF (red) and XRCC4-XLF 
complexes (yellow) to the expected diffusion coefficients based on the 
helical diffusion model’? (solid black line). To calculate the quantity on 
the horizontal axis, a hydrodynamic radius of 7-22 nm was used, based 
on the inner and outer radii of the XRCC4-XLF filament as proposed 
previously. f, Average observed dwell times of protein complexes before 
switching to a different mode. The analysis is performed on 121 events. 
The shortest dwell time that could be determined with certainty was 1s. 
All error bars denote s.e.m. 
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Extended Data Figure 7 | Rupture events in force-extension curves can 
be caused by DNA-protein bridges or nonspecific sticking of protein- 
bound DNA to the trapped microspheres. a, c, Force-extension curves 
of dsDNA-~XRCC4-XLF complexes (red) after incubation at low tension 
shows rupture events. Black data sets show force-extension curve of bare 
dsDNA. b, A single, continuous fluorescence kymograph corresponding 


17 


Position 


to red curve in a. At the indicated time (orange arrow in a and b) 

a protein bridge suddenly ruptures into multiple smaller protein 
complexes. d, Fluorescence kymograph corresponding to red curve in c 
reveals that rupture event corresponds to the rupture of a non-specific 


interaction between a DNA-bound protein complex and the polystyrene 
microsphere. Scale bars, 1s and 21m. 
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a b C | 
Extended Data Figure 8 | Schematic representation of dual-DNA configurations used in quadruple-trap experiments. a—c, Wrapped (a), unwrapped (b) 
and crossed (c) DNA configurations. 
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Extended Data Figure 9 | Quantification of DNA bridging by XRCC4-XLE. a, b, Normalized length of DNA segments over time during experiments 


such as shown in Figs 2 and 3. a, The static bridge in the experiment described in Fig. 2 and Supplementary Video 1. b, The mobile bridge in the 
experiment described in Fig. 3 and Supplementary Video 2. 
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Extended Data Figure 10 | Protein bridges always contain both XRCC4 and XLF. a, Typical examples of XRCC4-XLF bridges formed by incubating 
two wrapped DNA molecules in 200 nM eGFP-XLF and 200 nM XRCC4-Alexa Fluor 555 for 2 min. In all 152 bridges that were analysed, clear 
colocalization of the proteins at the junction is observed. 
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Dynamics of ribosome scanning and recycling 
revealed by translation complex profiling 


Stuart K. Archer!?*, Nikolay E. Shirokikh!3*, Traude H. Beilharz* & Thomas Preiss° 


Regulation of messenger RNA translation is central to eukaryotic 
gene expression control’. Regulatory inputs are specified by the 
mRNA untranslated regions (UTRs) and often target translation 
initiation. Initiation involves binding of the 40S ribosomal small 
subunit (SSU) and associated eukaryotic initiation factors (eIFs) 
near the mRNA 5’ cap; the SSU then scans in the 3’ direction 
until it detects the start codon and is joined by the 60S ribosomal 
large subunit (LSU)?> to form the 80S ribosome. Scanning and 
other dynamic aspects of the initiation model have remained as 
conjectures because methods to trap early intermediates were 
lacking. Here we uncover the dynamics of the complete translation 
cycle in live yeast cells using translation complex profile sequencing 
(TCP-seq), a method developed from the ribosome profiling® 
approach. We document scanning by observing SSU footprints 
along 5’ UTRs. Scanning SSU have 5’-extended footprints (up to 
~75 nucleotides), indicative of additional interactions with mRNA 
emerging from the exit channel, promoting forward movement. 
We visualized changes in initiation complex conformation as SSU 
footprints coalesced into three major sizes at start codons (19, 29 
and 37 nucleotides). These share the same 5’ start site but differ at 
the 3’ end, reflecting successive changes at the entry channel from 
an open to a closed state following start codon recognition. We 
also observe SSU ‘lingering’ at stop codons after LSU departure. 
Our results underpin mechanistic models of translation initiation 
and termination, built on decades of biochemical and structural 
investigation, with direct genome-wide in vivo evidence. Our 
approach captures ribosomal complexes at all phases of translation 
and will aid in studying translation dynamics in diverse cellular 
contexts. Dysregulation of translation is common in disease and, for 
example, SSU scanning is a target of anti-cancer drug development’. 
TCP-seq will prove useful in discerning differences in mRNA- 
specific initiation in pathologies and their response to treatment. 

Protein synthesis in eukaryotes is more complex than in prokaryotes, 
particularly at the highly regulated initiation step, in which eukaryote- 
specific mechanisms of ribosome attachment to mRNA and start codon 
recognition are used. Although multiple biochemical and structural 
studies support the scanning model of initiation, originally proposed 
by Marilyn Kozak almost forty years ago’, there still is no direct in vivo 
evidence” and alternative mechanisms have been proposed’. Several 
issues remain difficult to resolve, such as whether scanning occurs ina 
cap-severed or cap-tethered manner; what mechanism facilitates direc- 
tional movement of scanning SSU; what intermediates are adopted at 
particular stages; and what is the rate-limiting step of initiation in vivo 
(Extended Data Fig. 1 and Supplementary Discussion). Information on 
the position of translation complexes along mRNA is needed to address 
these questions. Ribosome profiling® maps elongating ribosomes along 
mRNA in vivo, however, it cannot record intermediates of translation 
initiation, termination and recycling. 


To address this, we developed translation complex profile sequenc- 
ing (TCP-seq; Fig. la and Supplementary Discussion). We crosslinked 
snap-chilled yeast (Saccharomyces cerevisiae) cells with formaldehyde, 
which efficiently fixes intermediates of ribosome function”, to stall 
and attach any translation complex type to mRNA at their native posi- 
tions. Next, polysomal translation complexes were isolated and digested 
with RNase I to generate protected mRNA fragments (footprints), as 
in ribosome profiling. However, ribosome and SSU fractions were 
then separated by sedimentation velocity before retrieval of footprints 
and high-throughput sequencing. Library inserts up to 250 nucleo- 
tides (nt) were selected to accommodate diverse footprint sizes and 
high-throughput sequencing reads (150 nt) were mapped to the yeast 
genome (Supplementary Table 1). Unfractionated polysomal mRNA 
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Figure 1 | Translation complex profile sequencing (TCP-seq) surveys 
all phases of mRNA translation. a, Outline of TCP-seq. See Methods and 
Supplementary Discussion for details. RS, ribosomes. b, Sedimentation 
profiles of ribosomes and ribosomal subunits after RNase cleavage of 
polysomes from wild-type (WT) and SYOI-TAP (SYO1) yeast strains. 
Shading indicates material used for high-throughput sequencing library 
preparation. Inserts show magnified (10) SSU peaks. AU, arbitrary units. 
c, Distribution of mapped reads across RNA types from SSU and ribosome 
libraries (see Supplementary Table 1 for raw read numbers). Note that 
RNase digestion and broad library insert size selection during TCP-seq 
pose challenges for rRNA depletion (Supplementary Discussion). d, tRNA 
distribution between SSU and ribosome libraries. Averaged values from 
wild-type and SYO1-TAP libraries are shown. Initiator tRNAM* (red), 
elongator tRNA.“ (green), and other tRNAs (blue). 
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Figure 2 | Snapshots of SSU and ribosome footprint distribution 
reveal translation initiation by a cap-severed, SSU-led scanning 
mechanism and staged ribosome recycling after termination. Graphs 
depict cumulative mRNA data transcriptome-wide from wild-type cells 
(see Extended Data Fig. 5 for the SYO1 strain results). Footprint (FP) 5’ 
ends are plotted versus footprint length; colour scale represents footprint 
count for each combination of footprint 5’ ends and footprint length (see 
Extended Data Fig. 6 for footprint 3’ end plots). a, Footprint distribution 
over the start codon region. All mRNAs are aligned by the first nucleotide 
(position 0) of the annotated ORF start codon. b, Footprint distribution 
over the stop codon region. As in a, but mRNAs are aligned by the first 


was processed alongside as a control (total input). Steady-state pol- 
ysomes mostly contain elongating ribosomes, with few initiating SSU. 
Accordingly, the sedimentation profiles featured a major ribosome- 
related peak and a minor SSU-related peak, the latter increased in 
mildly LSU-deficient SYO1-TAP (TAP, tandem-affinity purifica- 
tion) cells!! (Fig. 1b). Libraries were sequenced from wild-type and 
SYO1-TAP cells. The latter replicated all qualitative findings, while 
displaying the expected quantitative differences due to LSU defi- 
ciency, as described below. Regarding the specificity of ribosome 
and SSU purification, SSU libraries contained mostly SSU ribosomal 
RNA, whereas ribosome libraries harboured LSU and SSU rRNA ina 
~3:1 ratio (Fig. 1c). Furthermore, initiator transfer RNA tRNA 
was >60-fold enriched in SSU compared to ribosome libraries 
(Fig. 1d; Supplementary Table 1). Notably, ribosome footprint lengths, 
positions of footprint 5’ and 3/ ends relative to internal ribosome 
topology, and ribosome footprint distribution across open reading 
frames (ORFs) were all consistent with previous ribosomal profil- 
ing experiments”)? (Fig. 2a, b, top; Extended Data Figs 2 and 3a-c 
and Supplementary Discussion). No substantial sequencing bias was 
detected across the footprints (Extended Data Fig. 4). Altogether, this 
validates the TCP-seq approach. 
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nucleotide (position 0) of the annotated ORF stop codon. ¢, Footprint 
distribution near mRNA cap structures. Only mRNAs with 5’ UTRs longer 
than 80 nt are included. d, Start codon focused plots of SSU footprint 
length versus footprint 5’ (left) or footprint 3’ (right) end positions on 
mRNAs with annotated 5’ UTRs longer than 100 nt. Dashed lines indicate 
three distinct classes of SSU footprints with shared footprint 3’ ends and 
variable footprint 5’ ends, as depicted in the schematic (more details and 
quantification are in Extended Data Fig. 6e, f). Here the ratio of footprint 
5/ end density in a representative —70 nt to —50 nt window, to that on the 
start codon, is 1:21 (or 1:46 if footprints with 5’ ends between —16 nt and 
—10 nt are all included in the start codon tally). 


Unlike ribosome footprints, SSU footprints were prominent in 5’ 
UTRs and enriched at start codons, with some accumulation also near 
stop codons (see below) and negligible presence in 3’ UTRs (Fig. 2a, b, 
bottom; Extended Data Figs 2a-d, f, 5a, b and 6a—d). SSU footprints 
within ORFs possessed characteristics indicating that they are due 
to disassembly of a small fraction (~1%, as determined by sedimen- 
tation profiles, Fig. 1b) of elongating ribosomes during purification 
(Extended Data Fig. 3e, f and Supplementary Discussion), reflected 
also in a minor peak of LSU visible in Fig. 1b. The ratio of 5’ UTR 
to start codon footprint density was maximally 1:46 when aggregated 
across mRNAs having annotated 5’ UTRs long enough to unambigu- 
ously observe 5’ UTR footprints (see Fig. 2 legend and Supplementary 
Discussion). As expected, SYO1-TAP cells exhibited a similar SSU 
distribution to wild-type cells, but with stronger concentrations at start 
codons, consistent with delayed LSU joining in this strain (Extended 
Data Figs 2f and 5a-d). There was no increase in footprint density 
near the 5’ end of mRNAs (Fig. 2c; Extended Data Figs 2b, 7 and 8 for 
individual mRNAs); however, longer 5’ UTRs harboured more SSU 
on average (Extended Data Fig. 9a) without decreased SSU frequency 
at start codons (Extended Data Fig. 9b). There were no well-expressed 
mRNAs without SSU footprints in the 5’ UTR (Extended Data Fig. 9c). 
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Figure 3 | TCP-seq can monitor modes and efficiency of translation 
initiation for individual mRNAs. a, b, Aggregated footprint coverage 

for wild-type (black) or SYO1 (grey) cells versus 5’ UTR nucleotide 
position (aligned as in Fig. 2a). Names of individual mRNAs are indicated. 
Coverage is normalized to mRNA library size by the combined counts 

of 5‘ UTR and start codon footprints for each mRNA and then scaled by 
the maximum value. Region of the PABI mRNA sequence enriched for 
adenine nucleotides (a, bottom-left plot) and positions of the translated 
non-AUG (nAu) and canonical uORFs of the GCN4 mRNA (b) are 


These observations substantiate a model of cap-severed scanning 
(as opposed to cap-tethered scanning; Extended Data Fig. 1c) whereby 
SSU vacate 5’-end proximal regions of 5’ UTRs at a similar rate as they 
move along internal 5’ UTR regions, and multiple SSU can simultane- 
ously scan a 5’ UTR. 

Remarkably, 5’ UTR SSU footprint sizes ranged evenly from ~19 
to ~75 nt (Fig. 2 and Extended Data Figs 2e and 6e), well beyond 
the width of SSU alone and reminiscent of in vitro observations in 
which edeine-enforced start codon bypass resulted in broader mRNA 
protection’. To assess where SSU reside within such extended scanning 
complexes, we inspected footprint patterns on approach to the start 
codon. There, complexes were uniformly arrested with their leading 
edges at one of three major positions (Fig. 2d, right; Extended Data 
Fig. 6e), consistent with the expected entry channel configurations of 
initiating SSU (see later for analysis of footprint 3’ patterns at start 
codons). Concomitantly, footprint 5’ density for extended complexes 
began to decline from ~40 nt upstream of start codons (Fig. 2d, left). 
This pattern indicates that the extended length of scanning SSU foot- 
prints is due to additional mRNA contacts upstream, but not down- 
stream of SSU (Fig. 2d and Extended Data Fig. 6f), consistent with an 
earlier hypothesis that mRNA-binding by eIF4A, eIF4B/H and eIF4F 
prevents SSU from back-sliding'*’° (Supplementary Discussion; 
Extended Data Figs 1d and 10). Intriguingly, we also noted a minor 
presence of the 5’-extended footprints among start codon-associated 
ribosomes, suggesting some carryover of the scanning elFs into early 
elongation as postulated based on observations with reinitiation after 
translation of short ORFs'° (Fig. 2a and Extended Data Figs 5a and 
6a, c). Collectively, these observations favour an SSU-led, or ‘pushed; 
model of scanning, whereby additional interactions preferentially occur 
on mRNA behind the SSU, enforcing movement in the 5/— 3! direction, 
as opposed to a ‘pulled’ SSU model, in which such contacts would occur 
in front of the SSU (Extended Data Figs 1d and 10a). 

SSU footprints were further analysed for individual mRNAs with 
sufficient read depth. Multiple mRNAs with seemingly unstructured’’ 
and relatively short 5’ UTRs featured a smooth distribution of scanning 
SSU throughout their 5’ UTRs (including ACT and SSC1 in Fig. 3a; 
Extended Data Fig. 7), consistent with a linear, constant-rate scanning 
motion (Extended Data Fig. 1b); others displayed evidence of scan- 
ning block (Extended Data Fig. 8; Supplementary Discussion). This 
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indicated by amber bars. For the complete footprint maps of these and 
other mRNAs, see Extended Data Figs 7 and 8, and the web interface for 
TCP-seq data display (http://bioapps.erc.monash.edu/TCP/). c, mRNAs 
were ranked by the ratio of peak SSU footprint coverage in the 5’ UTR, to 
sum of peak SSU footprint coverages in the 5’ UTR and at the start codon. 
Positions of individual mRNAs shown in Extended Data Figs 7 and 8 are 
indicated on the plot and colour-coded to reflect higher peaks on start 
codon (red) or in 5’ UTR (blue). See Supplementary Table 2 for a full list of 
ranked mRNAs. 


was most common with longer 5’ UTRs and often associated with short 
upstream ORFs (uORFs) at which the SSU accumulated (for example, 
GCN4 in Fig. 3b). However, in many cases 5’ UTR SSU clustering could 
not be attributed to uORFs or apparent RNA structure’, and these 
probably represent initiation control governed by specific factor bind- 
ing (as can be expected for the A-rich sequence of PAB] mRNA"; 
Fig. 3a; Extended Data Fig. 8b and Supplementary Discussion) or the 
nucleotide context of the 5’ UTR itself. Overall, the full range of 5’ 
UTR to start codon ratios of peak coverage was observed (Fig. 3c), with 
some over-representation on the lower band due to the prevalence of 
5’ UTRs that are too short for unambiguous detection of 5’ UTR SSU 
footprints. This indicates prevalent use of 5’ UTR features to ‘tune’ 
scanning, and thus initiation efficiency, across the yeast transcriptome. 
The set of mRNAs ranking highly in the ratio of 5’ UTR to start codon 
peak coverage is rich in both known examples of, and new candidates 
for, regulatory 5’ UTR elements (for example, SEC4 mRNA in Fig. 3a; 
see also Supplementary Discussion, Supplementary Table 2, and the 
web interface for browsing and analysing the TCP-seq data at http:// 
bioapps.erc.monash.edu/TCP/). Thus, TCP-seq has unique utility for 
surveying and dissecting initiation control mechanisms operating on 
individual mRNAs. 

Start-codon-associated SSU footprints exhibited three main lengths 
(19, 29 and 37 nt; Fig. 2a bottom; Extended Data Figs 2e, 5a, 6a, c 
and 10a), mainly due to diversity in mRNA protection at the entry to 
the mRNA channel, with mostly uniform protection at the exit side 
(footprint 5’ at —12 nt, footprint 3’ at +6, +16 and +24 nt; Fig. 4a 
and Extended Data Fig. 5c). Recent structural studies can help inter- 
pret this pattern (Extended Data Fig. 10), with the caveat that in vivo 
initiation intermediates might not completely match the composition 
and/or structural configuration of in vitro assembled complexes. SSU 
footprints spanning from —12 nt to +6 nt (19 nt in length) conceivably 
derive from paused SSU still in open, scanning-competent configura- 
tion!?. SSU footprints extending to +16 nt would represent SSU in the 
closed state following AUG recognition, protecting 29 nt of mRNA. 
These SSU are expected to feature disengaged eIF1 and the N termi- 
nus of eIF1A structured in the A-site®?! and an entry channel that is 
tightly closed on mRNA”, with the eIF1A carboxy terminus bound 
to eIF5 (refs 20, 22). Finally, footprints ending at +24 nt could reflect 
a third SSU state just before LSU joining, where eIF5B:GTP attaches 
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at the entry of the mRNA channel, further extending protection™>*. 
Additional support for placement of SSU footprints in such a temporal 
sequence was derived from grouping them based on their 3’ ends and 
inspecting extended 5’ ends. Footprints ending around +6 nt included 
a minor population with footprint 5’ ends at up to —60 nt, suggesting 
that additional interactions characteristic of scanning are still partly in 
place (Fig. 4b, top). Footprints ending at +16 and +24 nt increasingly 
developed a second population with footprint 5’ ends around —30 nt, 
possibly the result of additional scanning SSU ‘queuing’ and further 
extending protection upstream by ~19 nt, due to cap recruitment 
and scanning outpacing start codon clearance (Fig. 4b, middle and 
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Figure 4 | Changes in SSU footprint size reveal dynamic rearrangements 
at start and stop codons. a, Aggregated profiles of SSU footprint 5’ 

end (red) and footprint 3’ end (blue) frequencies are plotted against 
mRNA position, and aligned by the first nucleotide in the P-site of start 
codon-arrested SSU (first nucleotide of the start codon) set at position 

0. Footprint frequencies are scaled by the maximum values and peak 
frequency positions indicated. The error regions shown as shading of the 
same colour indicate the 95th percentiles of 1,000 iterations of random 
gene-wise resampling across all 5,859 represented genes (see Methods for 
details). Data shown are from all annotated mRNAs in wild-type cells; see 
Extended Data Fig. 5c, d for results with SYO1. b, Left, relative frequencies 
of the footprint 5’ ends corresponding to the +6 (top), +16 (middle) and 
+24 nt (bottom) major footprint 3’ end types plotted against position on 
mRNA, aligned as in a. Footprint 3’ end selection windows are indicated. 
Only mRNAs with annotated 5’ UTRs longer than 50 nt are included. 
Error bars depicted in black are calculated as in a. Right, schematics 
summarizing footprint 5’ and 3’ ends for the major SSU footprint types 
observed in the corresponding left panels. The proposed cause for the 
different 5’-extended footprint protection is also indicated (see main text 
and Extended Data Fig. 10 for details). c, Same as a, except the alignment 
is by first nucleotide in the P-site of terminating SSU (first nucleotide of 
the last sense codon). d, Left, same as b, but for the +8 (top) and +16 nt 
(bottom) footprint 3’ clusters of terminating SSU. These two footprint 3’ 
end peaks were most apparent in the footprint subset having footprint 5’ 
end localization characteristic of terminating complexes (Extended Data 
Figs 3d and 5e). Right, same as b, but for the stop-codon-associated SSU 
(UAA represents any of the three annotated stop codons). 


bottom), although other explanations are also plausible. The shape of 
the SSU-related sedimentation peak (Fig. 1b) is also consistent with 
such complex heterogeneity. SSU footprints with all three footprint 3’ 
end positions are discernible on multiple individual mRNAs (Fig. 3a, b 
and Extended Data Figs 7, 8 and 9d-f), albeit in varying proportions 
probably indicative of an influence of local mRNA context on start 
codon recognition. These analyses serve to underpin models derived 
from biochemical and structural data, regarding the temporal sequence 
and molecular nature of rearrangements within SSU complexes at start 
codons, with direct in vivo evidence. 

Finally, enrichment of SSU footprints of two main lengths (29 and 
21 nt) is also observed on the last sense codon of annotated ORFs 
(Fig. 2b bottom; Extended Data Figs 2a, c, e, 5b, d and 6b, d, bottom 
panels). They share one footprint 5’ end at —12 nt, with footprint 3’ ends 
at +8 nt (minor) and +16 nt (major) (Fig. 4c; positions given relative 
to the first nucleotide in the P-site codon; see Extended Data Figs 3d 
and 5e), which is consistent with the increased presence of 3’-receded 
ribosome footprints (Extended Data Figs 3a, b and 5b, d). These shorter 
SSU and ribosome footprints conceivably reflect ribosome recycling 
intermediates with a more ‘oper’ entry channel configuration, poten- 
tially resulting from ABCE] activity at the entry site?> (Extended 
Data Fig. 10b). A small fraction of terminating SSU in wild-type 
cells again feature footprints with 5’-extended protection (Fig. 2b). It 
is tempting to speculate that this reflects interactions of ABCE1 with 
elF2, elF5 (ref. 26) and elF3 (ref. 27), and eRF3 with Pab1”%, potentially 
conferring these scanning factors on the expended SSU early during 
recycling to expedite its transfer for re-initiation in cis through the 
closed-loop mRNA structure formed by 5’ cap-eIF4E-eIF4G-Pab1-3' 
poly(A) interactions. As a major conclusion, these findings provide 
direct evidence of subunit splitting before mRNA dissociation”? for 
at least a considerable proportion of termination events, rather than 
complete ribosome dissociation*®® from mRNA (Extended Data 
Figs le and 10). 

TCP-seq generates in vivo evidence that can be used to address gen- 
eral and mRNA-specific aspects of translational control, particularly 
by 5’ UTR elements. Several entirely new lines of research are now 
within reach. For example, the combination of TCP-seq with existing 
collections of yeast initiation factor mutant strains will refine roles of 
individual factors and substantiate the molecular nature of transla- 
tion complex intermediates. Future studies should apply TCP-seq to 
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monitor translation under different growth conditions and in other 
eukaryotic and prokaryotic organisms. Given the importance of trans- 
lational control to human disease, including cancer’, TCP-seq could 
be used to aid in characterizing pathologic changes in translation and 
in the development of therapeutic approaches. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 3 December 2015; accepted 14 June 2016. 
Published online 20 July 2016. 


1. Schwanhdausser, B. et al. Global quantification of mammalian gene expression 
control. Nature 473, 337-342 (2011). 

2. Hinnebusch, A. G. The scanning mechanism of eukaryotic translation initiation. 
Annu. Rev. Biochem. 83, 779-812 (2014). 

3. Jackson, R. J., Hellen, C. U. & Pestova, T. V. The mechanism of eukaryotic 
translation initiation and principles of its regulation. Nature Rev. Mol. Cell Biol. 
11, 113-127 (2010). 

4. Hershey, J. W., Sonenberg, N. & Mathews, M. B. Principles of translational 
control: an overview. Cold Spring Harb. Perspect. Biol. 4, a011528 (2012). 

5. Merrick, W. C. Eukaryotic protein synthesis: still a mystery. J. Biol. Chem. 285, 
21197-21201 (2010). 

6. Ingolia, N. T. Ribosome profiling: new views of translation, from single codons 
to genome scale. Nature Rev. Genet. 15, 205-213 (2014). 

7. Bhat, M. et al. Targeting the translation machinery in cancer. Nature Rev. Drug 
Discov. 14, 261-278 (2015). 

8. Kozak, M. How do eucaryotic ribosomes select initiation regions in messenger 
RNA? Cell 15, 1109-1123 (1978). 

9. Paek, K. Y. et al. Translation initiation mediated by RNA looping. Proc. Natl. 
Acad. Sci. USA 112, 1041-1046 (2015). 

10. Valasek, L., Szamecz, B., Hinnebusch, A. G. & Nielsen, K. H. /n vivo stabilization 
of preinitiation complexes by formaldehyde cross-linking. Methods Enzymol. 
429, 163-183 (2007). 

11. Li, Z. et al. Rational extension of the ribosome biogenesis pathway using 
network-guided genetics. PLoS Biol. 7, €1000213 (2009). 

12. Lareau, L. F, Hite, D. H., Hogan, G. J. & Brown, P. O. Distinct stages of the 
translation elongation cycle revealed by sequencing ribosome-protected 
mRNA fragments. eLife 3, e€01257 (2014). 

13. Kozak, M. & Shatkin, A. J. Migration of 40S ribosomal subunits on messenger 
RNA in the presence of edeine. J. Biol. Chem. 253, 6568-6577 (1978). 

14. Siridechadilok, B., Fraser, C. S., Hall, R. J., Doudna, J. A. & Nogales, E. Structural 
roles for human translation factor elF3 in initiation of protein synthesis. 
Science 310, 1513-1515 (2005). 

15. Spirin, A. S. How does a scanning ribosomal particle move along the 
5’-untranslated region of eukaryotic mRNA? Brownian Ratchet model. 
Biochemistry 48, 10688-10692 (2009). 

16. Poyry, T.A., Kaminski, A. & Jackson, R. J. What determines whether 
mammalian ribosomes resume scanning after translation of a short upstream 
open reading frame? Genes Dev. 18, 62-75 (2004). 

17. Kertesz, M. et al. Genome-wide measurement of RNA secondary structure in 
yeast. Nature 467, 103-107 (2010). 

18. Gilbert, W. V., Zhou, K., Butler, T. K. & Doudna, J. A. Cap-independent translation 
is required for starvation-induced differentiation in yeast. Science 317, 
1224-1227 (2007). 


574 | NATURE | VOL 535 | 28 JULY 2016 


19. Llacer, J. L. et al. Conformational differences between open and closed states of 
the eukaryotic translation initiation complex. Mol. Cell 59, 399-412 (2015). 

20. Saini, A. K. et al. Eukaryotic translation initiation factor elF5 promotes the 
accuracy of start codon recognition by regulating Pi release and 
conformational transitions of the preinitiation complex. Nucleic Acids Res. 42, 
9623-9640 (2014). 

21. Lomakin, |. B. & Steitz, T. A. The initiation of mammalian protein synthesis and 
mRNA scanning mechanism. Nature 500, 307-311 (2013). 

22. Luna, R. E. et al. The C-terminal domain of eukaryotic initiation factor 5 
promotes start codon recognition by its dynamic interplay with elF1 and elF28. 
Cell Rep. 1, 689-702 (2012). 

23. Yamamoto, H. et al. Structure of the mammalian 80S initiation complex with 
initiation factor 5B on HCV-IRES RNA. Nature Struct. Mol. Biol. 21, 721-727 
(2014). 

24. Fernandez, |. S. et al. Molecular architecture of a eukaryotic translational 
initiation complex. Science 342, 1240585 (2013). 

25. Becker, T. et al. Structural basis of highly conserved ribosome recycling in 
eukaryotes and archaea. Nature 482, 501-506 (2012). 

26. Chen, Z. Q. et al. The essential vertebrate ABCE1 protein interacts with 
eukaryotic initiation factors. J. Biol. Chem. 281, 7452-7457 (2006). 

27. Khoshnevis, S. et al. The iron-sulphur protein RNase L inhibitor functions in 
translation termination. EMBO Rep. 11, 214-219 (2010). 

28. Roque, S. et al. Interaction between the poly(A)-binding protein Pab1 and the 
eukaryotic release factor eRF3 regulates translation termination but not mRNA 
decay in Saccharomyces cerevisiae. RNA 21, 124-134 (2015). 

29. Jackson, R. J., Hellen, C. U. & Pestova, T. V. Termination and post-termination 
events in eukaryotic translation. Adv. Protein Chem. Struct. Biol. 86, 45-93 (2012). 

30. Kurata, S. et a/. Possible steps of complete disassembly of post-termination 
complex by yeast eEF3 deduced from inhibition by translocation inhibitors. 
Nucleic Acids Res. 41, 264-276 (2013). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements This work was supported by an ARC Discovery Grant 
(DP1300101928) and an NHMRC Senior Research Fellowship (514904) 
awarded to T.P. N.E.S. was supported by a Go8 European Fellowship. We are 
grateful to A. G. Hinnebusch, C. G. Proud and R. D. Hannan for discussions 

and suggestions for this work. We acknowledge technical support from the 
Australian Cancer Research Foundation Biomolecular Resource Facility JCSMR, 
ANU), D. Powell and S. Androulakis at the Monash Bioinformatics Platform. 


Author Contributions S.K.A., T.H.B. and T.P. designed the research, S.K.A. and 
N.E.S. performed the experiments, S.K.A, N.E.S., T.H.B. and T.P. analysed the 
data, discussed the results and wrote the paper. 


Author Information The sequences of reads with poly(A) tracts were deposited 
to the Sequence Read Archive, under accession code SRPO74093. An online 
interface for browsing and analysing the TCP-seq data is available at (http:// 
bioapps.erc.monash.edu/TCP/) along with (http://dx.doi.org/10.6084/ 
m9.figshare.3206725). The underlying mapped dataset is also available 
(http://dx.doi.org/10.6084/m9.figshare.3206698). Reprints and permissions 
information is available at www.nature.com/reprints. The authors declare no 
competing financial interests. Readers are welcome to comment on the online 
version of the paper. Correspondence and requests for materials should be 
addressed to TP. (thomas.preiss@anu.edu.au). 


Reviewer Information Nature thanks E. Alkalaeva, P. Baranov and Y. Mechulam 
for their contribution to the peer review of this work. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


METHODS 


Data reporting. No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded to allocation 
during experiments and outcome assessment. 

Cell lines. The wild-type yeast cell line was BY4741. The SYOI-TAP (YDL063C- 
TAP) mutant cell line was a BY4741 yeast strain containing a C-terminal insertion 
of the modified TAP-tag in YDL063C*!. SYO1-TAP is known to exhibit cytoplas- 
mic LSU shortage and decreased initiation efficiency due to longer LSU waiting 
times, also manifested as polysomal half-mer phenotype in sucrose gradient sed- 
imentation profiles!’, similar to the SYO1-deficient cell lines*”. 

Cell crosslinking and lysis. Wild-type and SYO1-TAP cells were each grown to 
an optical density at 600 nm (ODg¢00 or Asoonm) of 0.7-0.8 in 1 litre of YPAD media 
and snap-chilled by mixing with 25% w/v of crushed ice. 0.1 volumes of freshly 
depolymerized 30% w/v paraformaldehyde was immediately added with intensive 
mixing and the mixture left for 10 min on ice. The fixed cells were pelleted by 
centrifugation at 4°C, 5,000g for 5 min and resuspended in 40 ml of buffer A 
(20mM HEPES-KOH pH 7.4 at 25°C, 100 mM KCl and 2mM MgCl.) supple- 
mented with 0.25 M glycine after 20 min of total fixation time. The cells were 
washed twice by sequential resuspension in 40 ml of buffer A and pelleting as 
described above. The washed cell pellet (1.1 g wet cell mass) was aspirated and 
resuspended in 40% v/w of buffer A supplemented with 0.5mM DTT, 1 U pl! 
RNaseOUT RNase inhibitor (Life Technologies) and 1x Complete EDTA-free Mini 
Protease Inhibitor (Roche). The cell suspension was then flash-frozen by dripping 
into liquid nitrogen. A 10 ml stainless steel grinding jar (Retsch) was pre-cooled in 
liquid nitrogen and filled with ~2 g of the frozen cell suspension droplets and two 
12mm stainless steel grinding balls (Retsch). The sealed grinding jar was shaken 
at 27 Hz for one minute in the Mixer Mill MM 400 (Retsch), re-cooled in liquid 
nitrogen, and shaking continued for another minute. 960 mg of the powdered cell 
lysates was thawed and clarified by two successive centrifugations at 4°C, 13,500g 
for 5 and 10 min, respectively, yielding ~400 1] of clarified lysate of which 38011 
(equivalent to ~650 mg of the original wet cell mass) was loaded onto two sucrose 
gradients in 190,11 aliquots. 

Polysome isolation and digestion. Complexes sedimenting at 80 Svedberg units 
(S) or faster (including monosomes and polysomes, but excluding individual ribo- 
somal subunits) were collected in the pellet fraction after centrifugation through 
a 10% to 20% sucrose gradient. 10% and 20% (w/v) sucrose stocks were prepared 
in buffer B (50 mM Tris-HCl pH 7.0 at 25°C, 50 mM NH,Cl, 4mM MgCh, 1mM 
DTT), and 0.7 ml of the 20% sucrose in buffer B was added to a 34 x 11mm 
(~2.2 ml) thin walled polyallomer tube (Beckman-Coulter). The tube was then 
topped up with 10% sucrose in buffer B, sealed with Parafilm, tilted to 80° from 
the vertical and rotated about its axis for 90s at 20 r.p.m., then slowly returned 
to vertical. The topmost 0.4 ml was removed and the tube rested for 1h at 4°C. 
Sample (up to 200 1) was layered on top and tubes were centrifuged in a TLS-55 
rotor (Beckman-Coulter) at 4°C, 52,000 r.p.m. (179,740g average, k-factor 55.7) 
for 85 min. This centrifugation time and speed were optimized beforehand by 
visualizing the SSU, LSU and ribosome bands using GelStar nucleic acid stain 
(Lonza) in a series of experiments with different centrifugation conditions. The 
pellet was then rinsed in cold buffer A and resuspended by intense pipetting and 
scraping in 200 l of buffer A supplemented with 1mM DTT and 1 x Complete 
EDTA-free Mini Protease Inhibitor (Roche). The resultant material was transferred 
into a 1.5 ml microcentrifuge tube, brought to the bottom by brief centrifugation 
and thoroughly homogenized using a microfuge pestle followed by vortexing at 
4°C for 2 min. Insoluble material was pelleted by centrifugation at 4°C, 13,500g 
for 5 min and the clarified supernatant solution collected. A 1511 aliquot of the 
solution was reserved for the Total Input library construction. The residual solution 
was digested with 15 U of E. coli RNase I (Ambion) per 1 OD2¢o unit of the solution 
at 23°C for 45 min. 

Separation of RNase-liberated polysomal SSU and ribosome complexes by 
sedimentation velocity. 7.5% to 30% w/v sucrose gradients were prepared with 
buffer B by adding 4 ml of 7.5% sucrose to a thin-walled polyallomer 14 x 89mm 
ultracentrifuge tube (Beckman-Coulter) and underlaying it with 3.96 ml of 23% and 
then 3.3 ml of 30% w/v sucrose in buffer B. After topping up with 7.5% w/v sucrose 
in buffer B, the tube contents were mixed by rotation as for the 10-20% gradients 
(see above) and 1.5 ml was removed from the top. The RNase I-digested material 
(see polysome isolation and digestion section) was layered atop (occupying 5-10% 
of the tube length) and centrifuged in an SW 41 Ti rotor (Beckman-Coulter) at 
4°C, 38,000 r.p.m. (178,305g average, k-factor 143.9) for 4h. The resultant gradients 
were fractionated at 1 ml min™! and the SSU (~1.5 ml) and ribosome (~2 ml) 
fractions were identified by in-line monitoring of the absorbance profile at 254nm 
and collected. 

De-blocking and isolation of the crosslinked RNA. RNA was extracted and 
crosslinks reversed by the hot acid phenol procedure as previously described*. 
Typically, a 1511 aliquot from the non-digested total input sample, a 50 il aliquot of 
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the ribosomes, or a 1,500 11 aliquot of the SSU gradient fractions was supplemented 
with 1% w/v sodium dodecyl sulphate, 10 mM EDTA, 10mM Tris-HCl (pH 7.4 at 
25°C), 10mM glycine and deionised water (if necessary) to result in 400 11 aliquots. 
400 ul of 25:24:1 phenol:chloroform:isoamy] alcohol (pH ~4.5, Sigma-Aldrich) 
was then added, and the mixture shaken at 65°C, 1,300 r.p.m. in a thermomixer 
for 45 min to de-crosslink. The mixture was then centrifuged at 15,000g at room 
temperature for 5 min, and the aqueous phase removed and nucleic acids precip- 
itated from it by adding 201g of glycogen, 0.1 volumes of 3 M sodium acetate and 
2.5 volumes of absolute ethanol. After incubation at —20°C for at least 3h, RNA 
was pelleted at 16,000g for 25 min, the RNA pellet washed twice in 80% ethanol 
and dried at 37 °C for at least 30 min in a sterile hood to evaporate residual phenol. 
Dried RNA was thoroughly resuspended by repeated pipetting and vortexing in 
1mM sodium citrate, pH 6.0, and assessed by measuring the absorbance spectrum. 
Construction of the RNA sequencing libraries. One ribosomal and one SSU RNA 
sequencing library was constructed for the wild-type and the SYO1-TAP mutant 
strains, using a low-bias strategy generally as outlined before**’>. RNA concentra- 
tion was estimated using a Bioanalyzer RNA Pico Chip (Agilent). De-crosslinked 
RNA (8 pmol) was heated for 2 min at 70°C, immediately transferred to ice for 
5 min, and then end-repaired with 20 U of 3’-phosphatase positive bacteriophage 
T4 polynucleotide kinase (PNK) (New England Biolabs), using conditions rec- 
ommended by the supplier (1x PNK buffer without ATP, 1.5 U pl~! RNaseOUT 
RNase inhibitor (Life Technologies), incubation for 2h at 37°C). The reaction 
was stopped and PNK inactivated by addition of EDTA to 1 mM and heating of 
the reaction mixture for 10 min at 65°C, respectively. The end-repaired RNA was 
next 3’ polyadenylated with 2 U of E. coli poly(A) polymerase (EPAP) (Ambion) 
generally according to the supplier recommendations (1x EPAP buffer, 2.5mM 
MnCl, 1mM ATP, 1mM DTT, and 0.8 U yl-! RNaseOUT RNase inhibitor (Life 
Technologies), incubation for 1h at 37°C). The resulting polyadenylated RNA 
was ethanol precipitated as before, dried and dissolved in 10 mM HEPES-KOH 
(pH 7.6 at 25°C), 0.1 mM EDTA. The RNA was next reverse transcribed using the 
split adaptor oligo(dT) primer (5'-phosphate-GA TCG TCG GAC TGT AGA ACT 
CTG AAC G/9-carbon spacer/ GTGA CTG GAG TTC CTT GGC ACC CGA GAA 
TIC CAT TTT TTT TTT TTT TTT TIT TVN-3’) with SuperScript III reverse 
transcriptase (Life Technologies), generally as recommended by the supplier. The 
RNA (up to 4pmol) was first mixed with 20 pmol of the split adaptor primer, 
10nmol each of dNTPs and annealed by heating to 75°C for 3 min, then cooling 
to 65°C, and slow ramping to 55°C. The reaction mixture was then supple- 
mented with 5mM DTT, 1 x First Strand Buffer (Life Technologies) and 2 U jl"! 
RNaseOUT RNase inhibitor (Life Technologies) while heated, and then slow- 
ramped to 50°C and incubated for 30 min. To purify the resultant cDNA away 
from the excess of the split adaptor primer, the reaction mixture was heated to 
60°C for 5 min, snap-cooled to 37°C, supplemented with 20 U of E. coli exonu- 
clease I (New England Biolabs) and incubated for 20 min. The reaction was then 
stopped by the addition of 23 mM EDTA. The residual non-extended split adaptor 
primer was removed by purification of the cDNA using Agencourt AMPure XP 
purification system (Beckman Coulter), strictly according to the manufacturer's 
recommendations, except for performing binding steps and removal of the 
supernatant with unbound material at 60°C. Washing of the beads and elution 
of the cDNA with 10 mM HEPES-KOH (pH 7.6 at 25°C), 0.1 mM EDTA were 
performed at room temperature. The resultant purified cDNA was further lib- 
erated from RNA contaminants by heating at 80°C for 10 min, snap-chilling on 
ice, supplementing with 1 jl of RNase A/T mix (ThermoFisher Scientific), and 
incubating for 20 min at 37°C. The purified cDNA was circularized with 50 U 
of CircLigase II (Epicentre/Illumina) generally according to the manufacturer's 
instructions (1x CircLigase II Buffer (Epicentre/Ilumina), 2.5mM MnCh, 
500 mM betaine, incubation at 60°C for 5h). The reaction was stopped by the 
addition of 25 mM EDTA and heating at 80°C for 10 min, and the circularised 
cDNA was purified using Agencourt AMPure XP purification system (Beckman 
Coulter) according to the manufacturer’s recommendations. The cDNA was 
eluted with 10 mM HEPES-KOH (pH 7.6 at 25°C), 0.1mM EDTA and treated 
with Probe-Directed Degradation (PDD), as described***>. The purified cir- 
cularised cDNA was supplemented with 2 pmol of each of the depletion DNA 
probes (Supplementary Table 3)*° and 1x Duplex-Specific Nuclease (DSN) 
buffer (Evrogen), and heated for 3 min at 95°C, then cooled to 75°C, and then 
slow ramped to the hybridization temperature of 48 °C. The mixture, while at the 
hybridization temperature, was then supplemented with 0.5 U of Duplex-Specific 
Nuclease (Evrogen) and further incubated for 20 min. The reaction was stopped by 
addition of 25 mM EDTA, the mixture diluted with sufficient amount of deionized 
water, phenol-chloroform (pH 8.0) extracted and gel-filtered using Micro Bio- 
Spin P-6 (Bio-Rad) columns. The PDD only partly removed rRNA content due 
to the unavoidable rRNA fragmentation and small depletion probe set employed. 
The resultant purified PDD-treated cDNA was amplified with Platinum Pfx 
DNA polymerase (Life Technologies), generally according to the manufacturer's 
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instructions, from primer pairs compatible with TrueSeq Small RNA Sample 
Preparation Kit (Illumina), bearing a unique tag for each library. The forward 
primers were 5‘/-CAA GCA GAA GAC GGC ATA CGA GAT XXX XXX GTG ACT 
GGA GTT CCT TGG CAC CCG AGA ATT CCA-3’ (in which XXX XXX represents 
Illumina’s indexing hexanucleotide sequences), and the reverse primer was 5'-AAT 
GAT ACG GCG ACC ACC GAG ATC TAC ACG TTC AGA GTT CTA CAG TCC 
GA-3’. The amplification reaction included the purified PDD-treated cDNA, 
0.3mM each of dNTPs, 1x Pfx buffer (Life Technologies), 1.25mM MgSOu,, 0.3 .M 
each of primers, 4.2 ng jl! extreme thermostable single-stranded DNA binding 
protein (New England Biolabs) and 0.05 U il! Platinum Pfx DNA Polymerase 
(Life Technologies), and was typically repeated for 12-14 cycles using 94°C melt- 
ing, 62°C (first two cycles) or 76°C (third and higher cycles) annealing, and 68°C 
polymerization temperatures. The amplified DNA samples from different libraries 
were equalized, pooled together, electrophoretically separated in a native agarose 
gel, selected for the size range of the insertion fragment of 10-250 nt, eluted from 
the gel by freezing the gel pieces at —20°C for 2h and recovering the DNA solution 
by immediate centrifugation of the gel pieces through 0.45 1m cellulose acetate 
filter at 13,000g. The recovered DNA solution was then phenol-chloroform (pH 8.0) 
extracted, ethanol precipitated (as described above), dissolved in 10 mM 
HEPES-KOH (pH 7.6 at 25°C), 0.1mM EDTA, quality-controlled with capillary 
electrophoresis (using Agilent High Sensitivity DNA chips), and directed to the 
high-throughput sequencing input™. 

The total input library was prepared as previously described* from the 
de-crosslinked RNA by a similar procedure, except that RiboMinus Eukaryote Kit 
(Life Technologies) was used to deplete the ribosomal RNA at the RNA stage and the 
RNA was fragmented with NEBNext RNA fragmentation module (New England 
Biolabs), instead of E. coli RNase I. Specifically, after crosslink reversal of the total 
input fractions (as described above), ethanol precipitation and re-solubilization 
of the total input RNA, the RNA was treated with RiboMinus Eukaryote rRNA 
removal kit (Life Technologies) strictly according to the manufacturer's instruc- 
tions. The resultant rRNA-depleted total input RNA was fragmented with NEBNext 
Mg?* RNA fragmentation module (New England Biolabs) at 94°C for 6min, and 
next used for library construction in parallel to the ribosome and SSU samples as 
described above. 

Sequencing was performed on a HiSeq 2500 (Illumina) using settings compatible 

with TrueSeq Small RNA Sample Preparation Kit (Illumina) and single-end reads 
of 150 nt. 
Filtering and mapping of the RNA sequencing reads. The 150 nt Illumina reads 
were subjected to a quality trim using Trimmomatic (window: 7 nt; phred quality 
cutoff: 24) followed by adaptor trimming including the (A)2» tract from the reverse 
transcription primer. The reads containing insert sequences without 3’ poly(A) and 
those shorter than 17 nt were discarded. To determine RNA biotype makeup of the 
libraries (Fig. 1c), the resultant insert sequences were then aligned to the S. cerevisiae 
genome (SacCer3) using Bowtie 2 (ref. 36). For all other analyses, a stepwise align- 
ment strategy was employed as follows. Reads were first filtered to keep only those 
not mapping to the rRNA locus (chromosome XII, base pairs from 450,000 to 
491,000). Remaining reads were then mapped to a custom tRNA sequence set and 
alignments were used to analyse tRNA content (see details below). Unmapped 
reads from this alignment were then aligned with spliced RNA sequences contain- 
ing ‘misc_RNA, ‘ncRNA; ‘tRNA or ‘snoRNA primary tags in SacCer3. Finally, the 
remaining unmapped sequences were aligned to an mRNA reference consisting of 
all spliced sequences annotated as protein-coding mRNAs, plus 1,000 bp of flank- 
ing genomic sequence, allowing multiple mappings. Alignment coverage ‘spikes’ 
consisting of identical reads and that exceeded the read density for the surrounding 
transcript region by more than 20-fold were flagged as mapping artefacts and 
excluded. In cases when the 3’ end position of the trimmed read sequences was 
aligned immediately upstream of adenine nucleotides, it was impossible to resolve 
the exact location of the 3’ end of the corresponding original RNA fragment before 
polyadenylation. With these reads, the 3’ end position was randomly assigned with 
equal probability between all possible locations. The metagene analysis was robust 
to subsequent re-randomization: the proportions of read 3’ ends that changed by 
more than 2 nt were 2.26%, 1.51% and 2.20% for input, ribosome and SSU samples 
respectively, causing imperceptible changes at the metagene level. 

mRNAs with 5’ UTR introns without precise splicing coordinates*” were 
removed from further analysis. We analysed skewness caused by library preparation 
by calculating the location-wise Shannon entropy (using the ‘entropy’ package in R) 


after aggregating input fraction fragment start/end position in the 14 most highly 
abundant ORFs (which had sufficiently few zero-read positions), treating each 
nucleotide position as a separate bin regardless of the sequence. All 14 ORFs exhib- 
ited Shannon entropies of >94% of that of a simulated (random, equiprobable) 
set of read positions, indicating low positional bias from the library preparation 
procedure. The RUST package** was also used to analyse skewness in the ribosome 
footprints, using an A-site offset of 17 and footprints of 31 nt in length, and to 
compare with such in known low-bias yeast ribosome profiling data sets*”“°. Due 
to an observed effect of library size on RUST scores, the two published data sets 
analysed were first randomly subsampled to equalize the library size. A trough in 
footprints with 3’ ends located over uridyl nucleotide residues of AUG start codons 
(detected as a diagonal line in, for example, Fig. 2a, Extended Data Fig. 5a, or as a 
vertical line in, for example, Extended Data Fig. 6a, ¢, to the left from the shorter 
of the SSU peaks) was associated with a small bias against 3’-terminal U’s and AU 
dinucleotides, as confirmed by analysing the aggregate densities of ORF-associated 
footprint 3’ ends located over internal AUG triplets and out-of-frame stop codons 
(Extended Data Fig. 4b-d). 

Metagene footprint end-densities were also subjected to gene-level bootstrap- 
ping to estimate the influence of individual genes on the density profile. Genes were 
randomly re-selected, allowing duplications, n times in which n was the original 
number of genes (5,859). Footprint end counts for each nucleotide within the 
regions of interest (—110 to +150 nt relative to start codon, and —170 to +80 nt 
relative the last ORF sense codon) were generated and normalized to total reads in 
the region of interest. This process was repeated 1,000 times and the mean and 95th 
percentiles of the footprint-end density at each nucleotide position were calculated. 

For tRNA content analysis, to overcome the multitude of similar isoforms and 
non-templated addition of ‘CCA to the 3’ ends of tRNAs, a custom reference was 
constructed. Non-redundant tRNA sequences were downloaded from GtRNAdb*"! 
(S288c strain, April 2011) and ‘CCA sequences added to the 3 ends if these were 
not template-specified. Only insert sequences mapping definitively to one tRNA 
were counted and results from the wild-type and SYO1-TAP lines were treated as 
duplicates for the tRNA analysis in edgeR™. 

Code Availability. Custom R scripts were used for this and all other analyses, 
available from the authors on request. 
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Extended Data Figure 1 | Current model of eukaryotic translation with 
emphasis on substeps with scarce, missing or ambiguous evidence. 
a, Schematic representing the current model of the translation cycle starting 
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encroach into the inner 5’ UTR regions (top), as opposed to cap-tethered 
scanning where the SSU 5/-end cap connection is preserved at least until 
start codon recognition. d, Position of the scanning factors relative to 
the SSU: SSU-led scanning in which initiation factors responsible for 

the scanning motion are located and interact with mRNA behind the 
SSU to push it in 5’—3’ direction (top), or an SSU-trailing mechanism 
where these factors interact with mRNA in front of the moving SSU to 
pull it along. e, Alternative recycling of termination complexes: mRNA is 
released from terminating ribosomes before they are disassembled into 
subunits (left), in contrast to a staged recycling where the LSU is first 
ejected from the post-termination ribosomes, leaving post-termination 
SSU attached to mRNA (right). 
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Extended Data Figure 2 | Overview of the in vivo distribution of 
translation complexes on mRNAs measured by TCP-seq. a, Snapshots 
of translation as represented by footprint 5’ ends on mRNA from wild- 
type strain ribosomes and SSU fractions (middle and bottom panels), 
total input mRNA is shown for reference (top panel). Footprint reads 
are mapped to all annotated mRNAs up to 1,000 nt of mRNA 5’ and 3’ 
flanking regions. Footprint 5’ end positions are plotted (horizontal axis) 
against the ORF length rank (vertical axis). Horizontal location (both 
within and outside the ORF) is scaled relative to ORF length, with start 
and stop codon positions indicated at the first nucleotide of these triplets 
(red lines). Footprints with 5’ ends accumulating at a constant distance 
upstream from start and stop appear as curved vertical trails due to the 
scaling. Transparency of points is increased by logio(2 + read number 


FP length (nt) 


per transcript) to allow the distribution in less abundant transcripts to be 
observed. b, c, Same as a, but the horizontal location is scaled to 5’ (b) or 
3’ UTR (c) length, and footprint 5’ ends are plotted against 5’ or 3/UTR 
length, respectively. d, Scheme showing classification of the footprints 
based on position along mRNAs for e, f. e, Profiles of the frequency of the 
footprint lengths in the 5’ UTR, start codon, ORE, stop codon and 3’ UTR 
regions for the ribosome and SSU fractions of the wild-type yeast. Only 
footprints mapping to mRNAs within annotated UTRs are analysed. The 
maximum frequency value for each plot is scaled to 1. f, Proportions of 
the footprints in the different mRNA regions for the ribosome and SSU 
libraries of wild-type and SYO1 yeast strains. For raw read counts, see 
Supplementary Table 1. 
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Extended Data Figure 3 | Characteristics of start-codon-associated, 
stop-codon-associated and ORF-associated ribosome footprints and 
comparison of TCP-seq with ribosome profiling. a, Aggregated 
densities of wild-type yeast footprint 5’ ends (FP-5’s; red) and footprint 

3’ ends (FP-3’s; blue) from the ribosome fraction, mapped to all annotated 
mRNAs and shown within the vicinity of the start (top) or end (bottom) 
of the ORFs (aligned as in Fig. 4a, c). Positions of the major footprint end 
density modalities are indicated (dashed lines). The error regions shown as 
shading of the same colour indicate the 95th percentiles of 1,000 iterations 
of random gene-wise resampling (see Methods for details). b, Schematics 
illustrating the calculated position and 3’ and 5’ footprint extensions for 
the major ribosome footprint types (alignment as for the SSU in Fig. 4b, d) 
across the start (top) or stop (bottom) codons. c, Correlation of the 
ribosome density on mRNAs in exponentially propagating wild-type 

yeast as measured by TCP-seq (vertical axis) or translation (ribosome) 
profiling (horizontal axis). Note that total input denotes either total 
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polysomal RNA (TCP-seq) or total cellular mRNA (ribosome profiling). 

d, End density for ribosome footprint 3’ ends plotted relative to the last 
sense codon from all ORFs, only for footprints with 5’ ends within the 
main peak for terminating ribosome complexes (from — 14 nt to —10 nt). 
Error regions are depicted as in a. e, Same as in Fig. 2a (top panel), but 
showing ORF region from +20 to +120 nt from the start codon. f, Register 
bias (triplet periodicity) for 5’ and 3’ ends of ORF-associated SSU and 
ribosome footprints. Footprints falling between +10 and +200 nt from the 
start codons of all mRNAs are binned into one of three registers according 
to the remainder when dividing the footprint 5’ or 3’ ends position 
(relative to the ORF start) by three. The relative difference between the 
frequency of the most common and least common register is 16.5% for 
ribosome and 9.9% for SSU. The SSU register is shifted 1 nt downstream 

to that of the ribosome, possibly due to a 1 nt recession in the footprint 

5’ ends after LSU dissociation. 
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Extended Data Figure 4 | Assessment of potential library or sequence 
bias. a, Comparison of sequence bias in the TCP-seq ribosome fraction 

to other ribosomal profiling experiments in yeast*?”°, using codon-level 
Kullbach-Liebler divergence as implemented in the RUST package*®. 
Peaks in skewness at the fragment ends likely result from library 
preparation artefacts while the peak at the central A-site (+17 nt from the 
footprint 5’ end) probably results from biological differences in codon 
decoding speed*®. b, Probabilities for occurrence of nucleotides around the 
5! or 3’ ends of TCP-seq reads (footprint 5’ and 3’ ends) generated using 
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Weblogo 3 (ref. 44), c, d, A depletion of footprint 3’ ends at the uridyl 
nucleotides. c, Depletion of footprint 3’ ends over U of the AUG triplets is 
characteristic of all AUG triplets within ORFs, and not only for start AUG 
codons (visible for example, in Extended Data Fig. 6a, c). Internal AUG 
triplets were selected from 50 nt downstream of start codons up to ORF 
ends, and footprint 3’ end densities aligned by these codons. d, Same as 

c, but footprint 3’ ends were aligned around the first nucleotides of 
out-of-frame stop codons located within ORF regions. 
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Extended Data Figure 5 | Assorted analyses of TCP-seq data for the (right panels) to allow for consistency in footprint 5’ and 3’ end extensions 
SYO1 yeast strain with mild deficiency in cytoplasmic LSU levels'?. across translation complexes. For terminating complexes, this corresponds 
a, b, As in Fig. 2a, b. c, d, Top panels as in Extended Data Fig. 3a, bottom to the first nucleotide of the last sense codon (not the stop codon). Error 
panels as in Fig. 4a, c. e, As in Extended Data Fig. 3d, but for the SYO1 regions in c-e indicate the 95th percentiles of 1,000 iterations of gene-wise 
cells. Note that horizontal axes in c, d are aligned by the position of the bootstrapping (see Methods). 
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Extended Data Figure 6 | Snapshots of SSU and ribosome footprint 
distribution reveal translation initiation by a cap-severed, SSU-led 
scanning mechanism and staged ribosome recycling after termination, 
visualized as footprint length versus footprint 3’ end positions 
aggregated across all mRNAs. a, b, As in Fig. 2a, b. c,d, As in Extended 
Data Fig. 5a, b, but for footprint 3’ ends. e, Heatmap of footprint 3’ end 
positions relative to start codons, aggregated amongst mRNAs having an 
annotated 5’ UTR of at least 100 nt in length*’. The extended scanning 
SSU are arrested when the leading edge (footprint 3’ end) overruns the 
start codon by a set margin, irrespective of the trailing edge (footprint 


40 nt < > 

5‘ end) position. This is evidenced by the scarcity of longer footprints 
with 3’ end > +30 nt position and 5’ end < —13 nt position (window 2; 
encompassing 175 footprints) relative to that expected from an 
equivalently bounded region (footprint 3’ end > —30 nt and footprint 

5’ end < —73 nt) that is fully within the 5’ UTR (window 1; encompassing 
1,552 footprints). Isoline iso-3’ and iso-5' margins forming window 1 and 
2 represent positions of the footprint 3’ end and footprint 5’ end isolines 
of SSU located upstream of (window 1) or on (window 2) the start codons. 
f, Schematic of plausible topologies of scanning SSU complexes as they 
approach and align over the start codons, to assist in interpreting data in e. 
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Extended Data Figure 7 | TCP-seq data for individual mRNAs from 
wild-type and SYO1 cells demonstrating uninterrupted 5’ UTR 
scanning and high frequency of initiation. a, Translation initiation on 
the 5‘UTR of ACT1 mRNA coding for actin. Individual SSU footprints 
observed on the ACT1 mRNA of wild-type (upper panel) and SYO1 
(middle panel) cells. Footprint 5’ ends of each footprint are shown in 
black. Annotated mRNA 5’ end* and ORF start are indicated by blue 
and red arrows above the panels and lines of the same colour; potential 


Position of mRNA from first ORF codon (nt) 


start codons and stop codons along the 5’ UTR are indicated below in 

each of three reading frames (lower panel), colour-coded by bars (with 

the legend on the right). Initiation codons located in perfect nucleotide 
context (for yeast) are shown as bars of double width. b, Same as a, but for 
a collection of additional mRNAs. See also the web interface (http://dx.doi. 
org/10.6084/m9.figshare.3206725) for browsing and analysing the 
TCP-seq data for wild-type and SYO1 yeast strains, available at (http:// 
bioapps.erc.monash.edu/TCP/). 
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Extended Data Figure 8 | TCP-seq data for individual mRNAs from 
wild-type and SYO1-TAP cells demonstrating SSU pile-ups within 

5’ UTRs and decreased frequency of initiation. Designations and 
abbreviations as in Extended Data Fig. 7. a, Translation initiation on the 
5’ UTR of GCN4 mRNA coding for general control nonderepressible 
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bZIP transcriptional activator of amino acid biosynthetic genes. b, Same 
as a, but for a collection of other mRNAs. See also the web interface for 
browsing and analysing the TCP-seq data for wild-type and SYO1 yeast 
strains (http://bioapps.erc.monash.edu/TCP/). 
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Extended Data Figure 9 | Efficiency of scanning and start codon 
selection across yeast mRNAs. a, Dependency of the ratio of 5’ UTR to 
start codon-associated SSU footprints on 5’ UTR length. b, Dependency 
of the relative SSU count at start codons on the length of the 5’ UTR. 

c, Dependency of 5’ UTR SSU footprint counts (vertical axis) on the 
abundance and 5’ UTR length of mRNA (horizontal axis; as determined by 
5’ UTR RNA fragment counts in the total input fraction) and translation 
initiation rates (colour; as determined by count of start-codon occupying 


SSU footprints). Data shown in a-c is for the wild-type yeast strain. 

d, e, Cross-correlation of the three major start codon footprint types by 
3’ end position (refer to Fig. 4a, b) observed across mRNAs of the wild- 
type (WT) (d) and SYO1-TAP (SYO1) (e) strains. f, Correlation between 
wild-type and SYO1 yeast cells of the gene-wise footprint count of each 
of the three major start codon footprint types. Pearson's correlation 
coefficients are indicated (r). All counts were incremented by 0.5 to allow 
logarithmic transformation. 
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Extended Data Figure 10 | Summary of TCP-seq findings and their 
interpretation in the context of existing mechanistic models and 
information on ribosome structure. a, Schematic depicting the proposed 
progression of translation initiation, including cap-severed, SSU-led 
scanning, and structural rearrangements at start codons. Post-termination 
events featuring lingering of SSU at stop codons are also shown. b, View 
of the translation cycle with emphasis on the nexus between TCP-seq 
footprint types and recent structural information on the SSU in complex 
with translation factors. The schematic models represent structures 
obtained for in vitro assembled complexes that are taken to represent 
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particular sub-steps of the translation cycle. Note that in vivo translation 
intermediates might not completely match the composition and/or 
structural configuration of in vitro assembled complexes. Schematic 
models representing the anticipated arrangement of scanning and start 
codon pausing SSU'*!°-*, start codon recognition'*!®*, pre-subunit 
joining’? and post-termination recycling”® SSU complexes are shown. 
eIF1 (blue), eIF1A (red), elF2 (dark green), initiator tRNAM* (light 
green), elF3 (purple), e[F4A/B/G/H and ABCE1 (amber), eIF5B (yellow), 
and elongator tRNA.“ (light green). 
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X-chromosome inactivation (XCI) involves major reorganization of 
the X chromosome as it becomes silent and heterochromatic. During 
female mammalian development, XCI is triggered by upregulation 
of the non-coding Xist RNA from one of the two X chromosomes. 
Xist coats the chromosome in cis and induces silencing of almost all 
genes via its A-repeat region”, although some genes (constitutive 
escapees) avoid silencing in most cell types, and others (facultative 
escapees) escape XCI only in specific contexts’. A role for Xist in 
organizing the inactive X (Xi) chromosome has been proposed*®. 
Recent chromosome conformation capture approaches have 
revealed global loss of local structure on the Xi chromosome and 
formation of large mega-domains, separated by a region containing 
the DXZ4 macrosatellite”“!°. However, the molecular architecture 
of the Xi chromosome, in both the silent and expressed regions, 
remains unclear. Here we investigate the structure, chromatin 
accessibility and expression status of the mouse Xi chromosome 
in highly polymorphic clonal neural progenitors (NPCs) and 
embryonic stem cells. We demonstrate a crucial role for Xist and the 
DxXZ4-containing boundary in shaping Xi chromosome structure 
using allele-specific genome-wide chromosome conformation 
capture (Hi-C) analysis, an assay for transposase-accessible 
chromatin with high throughput sequencing (ATAC-seq) and 
RNA sequencing. Deletion of the boundary disrupts mega-domain 
formation, and induction of Xist RNA initiates formation of the 
boundary and the loss of DNA accessibility. We also show that in 
NPCs, the Xi chromosome lacks active/inactive compartments and 
topologically associating domains (TADs), except around genes that 
escape XCI. Escapee gene clusters display TAD-like structures and 
retain DNA accessibility at promoter-proximal and CTCF-binding 
sites. Furthermore, altered patterns of facultative escape genes in 
different neural progenitor clones are associated with the presence 
of different TAD-like structures after XCI. These findings suggest 
a key role for transcription and CTCF in the formation of TADs in 
the context of the Xi chromosome in neural progenitors. 

To investigate the structure of the Xi chromosome, we performed 
allele-specific Hi-C in a clonal neural progenitor cell (NPC) line that 
was derived from highly polymorphic F; mouse embryonic stem (ES) 
cells (129 x Cast, Extended Data Fig. la—f). We first performed Hi-C in 
ES cells, in which XCT has not yet occurred, and found that autosomes 
and both active X chromosomes displayed prominent active/inactive 
(A/B) compartmentalization and TAD structures (Extended Data 
Figs 2a—c and 3). In NPCs, compartments and TADs were similarly 
detected on autosomes and the active X (Xa) chromosome (Fig. 1a, 
Extended Data Fig. 3a). Notably, however, the Xi chromosome dis- 
played no A/B compartments (Extended Data Fig. 2c), but was instead 
partitioned into two massive interaction domains separated by a hinge 
region of ~200kb including the DXZ4 macrosatellite!'!* (Fig. 1a), 


as recently reported for the human and mouse Xi chromosomes’. 
Furthermore, TADs were found to be largely absent on the Xi chro- 
mosome (Fig. la), as previously suggested®””. 
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Figure 1 | The distinct conformation of the Xi and Xa chromosomes. 
a, Allele-specific Hi-C maps of the Xa and Xi chromosomes in NPCs (left), 
and two increasingly smaller regions centred around DXZ4 (centre and 
right). Purple areas in the insulation score plots indicate the interquantile 
range (IQR) of insulation scores over the entire X chromosome, to illustrate 
reduced insulation scores along the Xi chromosome (indicating loss of TADs). 
Black arrow: position of a residual TAD. Red arrow: position of DXZ4. 
b, Top, scheme of DNA FISH probes. Bottom, probes a-b (within the same 
mega-domain) are more overlapping and spherical on the Xi than on the Xa 
chromosome, whereas signals from b-c (across the mega-domain boundary) 
show partitioning on the Xi chromosome into two separate domains. ¢, Loci 
detected by probes a-b are more interacting than b-c on the Xi chromosome, 
in both Hi-C (top) and 3D-DNA FISH (bottom). *P<8 x 107!” (Wilcoxon's 
rank sum test corrected with Bonferroni). NS, not significant. n denotes 
number of cells analysed in DNA FISH. Centre lines: medians, all experiments 
were performed in biological duplicates. 
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Figure 2 | Expression, chromatin accessibility and chromatin 
conformation along the Xi chromosome. a, Hi-C interactions, insulation 
score, ATAC-seq signal and location of expressed genes (via RNA-seq) in 
three regions of the Xa and Xi chromosome. Left to right: the cluster of 
facultative escape genes containing Mecp2, the Xist locus and the region 
encompassing Jarid1c. Hi-C data are shown at 40-kb resolution. b, Regions 
with increased TAD structure harbour promoters that are expressed and 
accessible on the Xi chromosome, as shown by analysis of Hi-C insulation 
(TAD structure), ATAC-seq read counts, and RNA-seq reads per kilobase 
of transcript per million mapped reads (RPKM). Each column represents 
a promoter, sorted by insulation score in the 40-kb interval containing the 
promoter, lowest-to-highest (weakest-to-strongest TAD signal). 


To investigate the spatial organization of mega-domains at the 
single-cell level, we performed DNA fluorescence in situ hybrid- 
ization (FISH) with 18-Mb probe sets located within one mega- 
domain (probes a-b), or spanning the mega-domain boundary 
(probes b-c) (Fig. 1b, Extended Data Fig. 4a). Despite extensive 
cell-to-cell variation, regions within the same mega-domain showed 
greater overlap on the Xi than on the Xa chromosome (Fig. 1b, c, 
Extended Data Fig. 4b), whereas regions spanning the boundary 
showed lower overlap on the Xi chromosome, in agreement with 
Hi-C data (Fig. 1c). Similar results were obtained in an NPC clone 
with a Cast Xi chromosome and in astrocytes (Extended Data Fig. 4c). 
Using two independent quantification methods (Extended Data 
Fig. 4d-f), the volume of each 18-Mb region was found to be modestly 
(approximately 20%) but significantly smaller on the Xi than the 
Xa chromosome, consistent with observations on the human Xi 
chromosome!?-)», Thus, the mouse Xi chromosome is moderately 
compacted and partitioned into two large, spatially distinct domains 
that show varying degrees of overlap within the cell population 
(Fig. 1b, c, Extended Data Fig. 4). 

Although the Xi chromosome globally presented no TADs in 
NPCs, we could detect a few residual TAD-like structures (Fig. 1a, 
black arrow). Integration of Hi-C, RNA sequencing (RNA-seq)!° and 
ATAC-seq data produced in the same lines revealed that these Xi chro- 
mosome TAD-like structures correspond to hotspots of residual tran- 
scription and open chromatin. The RNA-seq and ATAC-seq profiles 
are consistent with global inactivity of the Xi chromosome (Extended 
Data Fig. 5a), with a 75% reduction in both the number of expressed 
genes and accessible elements compared to the Xa chromosome. Most 
ATAC-seq peaks on the Xi chromosome fall in the pseudoautosomal 
region, the Xist locus and at the promoters of genes that escape 
XCI either facultatively or constitutively? (Extended Data Fig. 5a). 
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ATAC counts are extracted from promoters (TSS + 500 base pairs (bp)). 
c, Xi-chromosome-expressed genes (escapees) fall within regions 

with higher insulation scores on the Xi chromosome as compared to 
Xi-chromosome-silenced genes (*P= 4.44 x 107 '°, Fisher exact test 
test), despite having similar insulation scores on the Xa chromosome 
(P=0.43114, Kolmogorov-Smirnov test). d, ATAC-seq peaks on the 
Xi chromosome tend to be closer to TSSs (within 5 kb) than peaks 

on autosomes and the Xa chromosome (statistical significance was 
assessed by Fisher exact test). e, Interaction pile-up maps showing mean 
Hi-C signal for all pairwise combinations of the 87 wild-type NPC Xi 
chromosome escapees on the both the Xa and the Xi chromosome. 


The amount of local structure on the Xi chromosome correlates with 
the density of transcribed loci and accessible elements (Extended 
Data Fig. 5d), as shown in Fig. 2a by three examples: a cluster of 19 
facultative escapees including Mecp2, overlapping a strong ~800-kb 
TAD-like structure; the Xist locus with moderate interactions across 
a ~250-kb region; and 5 escapees including the constitutive Jarid1c 
(also known as Kdm5c) and the facultative Huwe1 genes, embedded 
ina ~500-kb TAD. Importantly, residual TAD-like structures on the 
Xi chromosome occasionally coincide with sub-TAD structures on the 
Xa chromosome (Fig. 2a). Generally, escapees are located in Xi chro- 
mosome regions with higher TAD strength and chromatin accessibility 
as compared to silenced genes (Fig. 2b, c, Extended Data Fig. 5b). 
Thus, in NPCs, the appearance of TAD structures is intimately linked 
to gene expression on the Xi chromosome, unlike on the Xa chromo- 
some and autosomes, where TADs are present even in the absence of 
transcription. 

Notably, 51% of Xi chromosome accessible sites are <5 kb from 
a promoter compared to ~35% on the Xa chromosome (Fig. 2d), 
suggesting that escape is often regulated through promoter-proximal 
sites. Most ATAC-seq peaks on the Xi chromosome were found at 
CTCF-binding sites (Extended Data Fig. 5c), implicating CTCF in 
escape. Recent findings suggest that cohesin (which co-localizes with 
CTCE) is globally lost on the Xi chromosome, which may lead to the 
chromosome-wide loss of TADs®. Our discovery that only escapees 
show TAD-like structures on the Xi chromosome, and that they are 
associated with putative CTCF sites, is consistent with a role for CTCF 
in TAD formation and/or maintenance’”"’. 

Escapees on the Xi chromosome also tend to interact with each 
other even across the mega-domain boundary, consistent with pre- 
vious circularized chromosome conformation capture (4C) analysis 
results? (Fig. 2e and Extended Data Fig. 2d, e). 
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To investigate the importance of the unusual bipartite organiza- 
tion of the Xi chromosome we deleted the ~200 kb boundary region 
encompassing the DXZ4 locus, specifically on the 129 allele in ES cells 
(AFT) (Extended Data Figs 1b and 6a). After differentiation, many 
NPC clones were isolated with a 129 (AFT) Xi chromosome. Deletion 
of the boundary did not affect XCI establishment, as NPC clones with 
either a wild-type or AFT Xi chromosome were obtained. Hi-C per- 
formed on one such clone (D9B2) revealed massive reorganization of 
the AFT Xi chromosome resulting in fusion of the two mega-domains 
(Fig. 3a). No effect was visible on the Cast Xa chromosome (Extended 
Data Fig. 7a). DNA FISH confirmed that sequences on either side of 
the deleted boundary overlap significantly more on the AFT Xi chro- 
mosome compared to wild type, consistent with Hi-C data (Extended 
Data Fig. 8a). 

Notably, in the D9B2 clone, facultative escapees (Mecp2, Huwe1) no 
longer escaped from XCI on the AFT Xi chromosome (Extended Data 
Fig. 6b-c). ATAC-seq and RNA-seq confirmed substantially reduced 
escape on the AFT Xi chromosome, with only 29 expressed genes 
(Extended Data Fig. 8b). Transcription and chromatin accessibility 
were lost at 66 of the 87 facultative escape genes (~76%) (Fig. 3b), 
but maintained at all 6 constitutive escapees (Fig. 3b, Supplementary 
Table 1 and Extended Data Fig. 8c). However, analysis of multiple 
NPC clones, both wild type and AFT, revealed only a slight tendency 
for AFT NPC clones to show less escape than wild type, with var- 
ying degrees of facultative escape between clones, with or without 
the mega-domain boundary (Extended Data Fig. 6d). Nevertheless, 
clone D9B2 provided a unique opportunity to study the relationship 
between transcriptional activity and chromosome conformation on 
the Xi chromosome. 

TAD-like structures were absent on the Xi chromosome when 
expression was lost at facultative escapees in the D9B2 clone (Fig. 3b 
and Extended Data Fig. 7b). Xi chromosome-wide comparisons 
between the D9B2 (AFT) and wild-type NPCs revealed a strong cor- 
relation between loss of escape, loss of chromatin accessibility and 
reduction in TAD signal (Fig. 3c). Specific long-range interactions 
between escapees were also lost on the AFT Xi chromosome (Fig. 3d). 
Eight genes showed de novo escape in the D9B2 clone (Supplementary 
Table 1). These de novo escapees were not clustered or highly acces- 
sible by ATAC-seq, and their expression did not lead to strong TAD 
formation (data not shown). 

Of the 224 Xi chromosome ATAC-seq peaks that we detected in the 
wild-type clone, 139 were lost in the D9B2 clone (Fig. 3e, Extended 
Data Fig. 8b). These lost sites were enriched for promoter-proximal 
location (64%) (Fig. 3f, g). In total, 93% of these promoter-proximal 
sites contain CTCF-binding sites, an enrichment compared to the 
64% of promoter-distal sites (Fig. 3f). These CTCF sites are closer 
to escape gene transcription start sites (TSSs) than unchanging sites, 
again pointing to a role for promoter-proximal CTCF in escape gene 
regulation (Fig. 3h). 

We next investigated the role of Xist in establishing the unusual 
organization of the Xi chromosome. We induced Xist expression in 
undifferentiated male (XY) ES cells carrying a tetracycline-inducible 
promoter at the endogenous Xist locus’. Hi-C revealed that 48h of 
Xist induction resulted in notable structural changes along the X chro- 
mosome, and these changes were not observed after induction of a 
Xist mutant lacking the A-repeat region (Fig. 4a, b)—which cannot 
silence genes (Extended Data Fig. 9a), but is competent for Xist coat- 
ing and exclusion of RNA polymerase II (refs 1, 4). Wild-type Xist 
induction did not lead to detectable changes in TAD structure (data 
not shown), but resulted in increased interaction frequencies along the 
chromosome (Fig. 4b, d). The contact map of the wild-type Xist-coated 
X chromosome was found to be more similar to that of the NPC Xi 
chromosome than either the non-induced or the A-repeat mutant Xist- 
coated X chromosome (Fig. 4a). Notably, physical separation across 
the mega-domain boundary occurred (Fig. 4b), confirmed by RNA/ 
DNA FISH (Fig. 4c, Extended Data Fig. 9b). Induction of wild-type 


LETTER 


DXZ4 DXZ4 


52.88 Mb 


92.96 Mb 82.67 Mb 


a 


63.08 Mb 


NPC AFT 
Cast (Xa) 


0 Ce 1,000 
Normalized reads 


Normalized reads 


Normalized reads 
Normalized reads 
Normalized reads 


74.8Mb 146.38 Mb 


_Mega-domain 
boundary region 


WT NPC 
129 (xi) 
= 
2 fo) 
0 Ca 1 300 
Normalized reads 


ATAC 


Expressed 
genes 


(xi) 


= 
fe} 


NPC AFT 


oD 
N 
it 

iT 


0 a 1,800 
Normalized reads 


41,25 
insulation | L 
1.25 


ATAC al 
ol 


Expressed 23 ] 
genes 


c d 


405 () gained in AFT Xi Xi insulation Lost in AFT Xi (+) 
ell 
Xi_ATAC 


-1 Mb 
35 | NPC 
a 1M 


Xi RPKM a 
10 NPC 5 
3 i NPC AFT 
+1 Mb 


| Rp/10;Snora70 IJaridic 


-1Mb_ +1Mb 
_a 


NPC AFT 
Cast (Xa) 


5 a 40 
Normalized reads 


NPC AFT 
129 (Xi 


X-chromosome genes 


e f g h ; 
EM Within 5 kb of TSS 5 CTCF motifs 
ATAC-seq peaks ATAC-seq peaks OD >5 kb from TSS = — Xi lost in AFT 
on WT Xi lost on AFT Xi => 5 0.8{ — Xi no change in AFT 
— Xa specific 
H3K27ac Other within 5 kb 
p300 of TSS 


No >5 kb 
change 


m from TSS 
Lost in 


AFT CTCF 
Other’ 
H3K27ac 


70 

60 s 
50 

40 

30 

20 

10 

0 


CTCF No Lost 
change in AFT 


ATAC-seq peaks (% 


Distance to TSS 
(log, base pairs) 


Figure 3 | Deletion of the mega-domain boundary leads to loss of 
bipartite folding. a, Hi-C contact maps for Xa (Cast) and Xi (129 AFT) 
chromosomes in mega-domain boundary mutant NPCs (left), and 

for increasingly smaller regions (centre and right). b, Two regions 

on the AFT Xi chromosome showing Hi-C, RNA-seq and ATAC-seq 
signal. The same regions as in Fig. 2a are shown. ATAC-seq and RNA- 
seq from the Xi chromosome in wild-type NPCs are included for 
reference (shown in Fig. 2a). c, Loss of TAD structure in AFT NPCs 
correlates with loss of accessibility (ATAC) and expression (RPKM). 
Each column represents a promoter. Heat maps are sorted by change 

in TAD strength (insulation score) from wild-type to AFT Xi in the 
40-kb interval containing the promoter. ATAC counts are extracted 
from promoters (TSS + 500 bp TSS). d, Interaction pile-up map showing 
mean interaction signal in AFT NPCs for all pairwise combinations of 
the 87 wild-type NPC Xi chromosome escapees. e, Quantification of 
ATAC-seq peaks in wild-type and AFT NPCs on the Xi chromosome. 
Of 224 Xi chromosome peaks in the wild type, 139 are lost in the mutant. 
f, Chromatin immunoprecipitation followed by sequencing (ChIP-seq) 
annotation of ATAC-seq peaks lost in AFT NPCs. g, Distance from TSS 
of ATAC-seq peaks that are lost, or do not change, after deletion of the 
mega-domain boundary. h, Distance from escapee TSS of CTCF peaks 
that are lost in AFT NPCs, do not change in AFT NPCs, and peaks on the 
Xa chromosome. 
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Figure 4 | Xist-mediated silencing is sufficient to generate a boundary 
at DXZ4 in ES cells. a, Top, Hi-C analysis on the X chromosome in 

male ES cells expressing wild-type (WT) or A-repeat mutant (AA) Xist, 
before (—dox) and after (+-dox) 48 h induction. dox, doxycycline. Bottom, 
correlation analysis of male X chromosome and NPC Xi chromosome 
Hi-C maps, showing increased similarity between the male Xist-coated 

X and Xi chromosomes after induction of wild-type, but not AA Xist. 

b, Alignment of structural changes detected in Hi-C and changes in 
chromatin accessibility measured by ATAC-seq after 48 h of wild- 

type or AA Xist expression. Dashed line: DXZ4 position. Arrowheads: 
increased interactions on either side of the mega-domain boundary. 
Asterisk: genomic position of Firre. c, RNA/DNA FISH showing increased 
overlap of probes a-b on the Xist-coated X chromosome after 48 h of 


Xist from one X chromosome in female ES cells”° generated a bound- 
ary of comparable magnitude (Extended Data Fig. 9c). ATAC-seq 
showed that 48 h of wild-type (but not A-repeat mutant) Xist expression 
results in globally reduced (but not eliminated) accessibility on the X 
chromosome (Fig. 4b, e). Interestingly, a small number of loci showed 
increased chromatin accessibility after wild-type Xist coating includ- 
ing the Firre long noncoding RNA (Fig. 4b), which together with the 
DXZ4 macrosatellite has been proposed to anchor the Xi chromosome 
to the nucleolus?!. Regions of reduced chromatin accessibility tend to 
show increased interaction frequencies with loci in the surrounding 
~20 Mb of genomic sequence (Extended Data Fig. 9d). We note that 
conformational changes on the Xist-coated X chromosome in male ES 
cells were relatively mild compared to the NPC Xi chromosome, prob- 
ably for several reasons. First, Xist was induced in ~35-45% of cells (in 
two independent replicates). Detecting structural alterations in such 
a sub-population may be difficult by Hi-C, and this may explain our 
observation that TADs are still detected in the overall cell population. 
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wild-type, but not AA Xist induction. Probes b-c show lower overlap 
(indistinguishable from non-Xist coated chromosomes in cells where 
Xist expression was not induced after doxycycline treatment), in line with 
Hi-C predictions (see Methods). Xist— signals correspond to the —dox 
sample. *P < 0.05 (Wilcoxon’s rank sum test corrected with Bonferroni). 
n denotes number of cells analysed in DNA FISH. Centre lines: medians. 
Boxes: middle 50% of data points. Two biological replicates were analysed. 
d, Changes in Hi-C contact probability after 48h of wild-type or AA 

Xist expression, indicating a wild-type Xist-specific increase in contact 
probability on the X chromosome. e, Changes in the ATAC-seq signal at 
X chromosome and autosomal peaks, indicating wild-type Xist-specific 
loss of chromatin accessibility on the X chromosome. 


Second, the Xist-coated Xi chromosome chromatin state in ES cells may 
not be comparable to that in differentiated NPCs”. 

In conclusion, our study uncovers a complex molecular architec- 
ture of the Xi chromosome, and reveals new insights into TAD for- 
mation. The Xi chromosome is moderately compacted and organized 
into two mega-domains, with global loss of TADs, except at clusters 
of expressed genes. Thus, in contrast with the notion that TADs are 
highly stable across differentiation and do not require transcription 
for their presence or maintenance!?3, our findings demonstrate that 
(1) TADs can be lost in the context of the Xi chromosome, at least in 
NPCs; and (2) transcription together with binding of factors such as 
CTCE may enable their maintenance or de novo re-creation. 

The bipartite folding of the Xi chromosome into two mega- 
domains is evolutionarily conserved, pointing to a possible role for 
this peculiar organization in the XCI process. Deletion of the bound- 
ary region, leading to fusion of the two mega-domains, does not affect 
initiation of XCI. However, somewhat reduced rates of facultative, 
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but not constitutive, escape are observed. Facultative escapees are 
silenced during XCI and then re-expressed**”> and may be more 
prone to stochastic events and/or the influence of long-range inter- 
actions with other parts of the X chromosome, such as the CTCF-rich 
DXZ4 region*®. As promoter-proximal CTCF sites characterize both 
facultative and constitutive escape genes, we speculate that transient 
interactions of escapees with the boundary region during XCI may 
influence escape (Extended Data Fig. 9e). However, the highly varia- 
ble facultative escape that we found between NPC clones, both with 
or without the DXZ4 boundary, suggests that escape efficiency cannot 
be due just to the presence of the DXZ4 region, but may be influenced 
by local folding and accessibility to CTCF binding. The exact timing 
and mechanism of facultative escape, and its relationship with TAD 
formation, will require further investigation. In conclusion, our study 
establishes the Xi chromosome as a powerful system for studying the 
mechanistic relationships between chromosome conformation and 
gene regulation, and points to key roles for gene activity and CTCF 
in the establishment of TAD structure in the context of facultative 
heterochromatin. 

Note added in proof: Consistent with our observations in mouse, 
deletion of DXZ4 from the human Xi chromosome results in loss of 
mega-domain structure (E. Lieberman Aiden & B. Chadwick, Deletion 
of the macrosatellite DXZ4 on the human inactive X chromosome alters 
higher-order genome architecture. Proc. Natl Acad. Sci USA (in press)). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Cell culture. The hybrid mouse ES cell line F121.6 (129Sv-Cast/EiJ), a gift from 
J. Gribnau, was grown on mitomycin-C-inactivated mouse embryonic fibroblasts 
in ES cell media containing 15% FBS (Gibco), 10-4M B-mercaptoethanol (Sigma), 
1,000 U ml"! leukaemia inhibitory factor (LIF, Chemicon). Male-inducible TXY 
and TXY:AA lines (a gift from A. Wutz, called Xist-tetOP and Xist-ASX-tetOP, 
respectively, in ref. 1) were cultured in the same conditions and treated for 48h with 
2g ml"! doxycycline. Differentiation of F121.6 ES cells into NPCs was performed 
as previously described'®. Subcloning of NPCs was made by limiting dilution and 
manual colony picking. All cells used in this study were characterized for absence 
of mycoplasma contamination. 

Boundary deletion. To generate the boundary region deletion, 5 x 10° ES cells 
were transfected with 51g each of two plasmids (pX459) each expressing Cas9 
and a chimaeric guide RNA (gRNA1: CATGTTTGAGCATGGAAACCCGG, 
chrX:72823838-72823860; gRNA2: GGGTTATGGCGGTCGGTTCCTGG, 
chrX:73025513-73025535). Subcloning of ES cells was made by limiting dilu- 
tion. Cells were treated for 24h with puromycin. As soon as visible, single col- 
onies were picked under a microscope to be screened for deletion by PCR 
(forward primer: 5’-CGTAGACGCGGCAGTAGTTT-3’, reverse primer: 
5/-ACATAAACTCCTTTTCAGGACCA-3’). To identify the targeted allele, we 
performed a PCR using primers (forward: 5’-CTGTCCAAATGGAGGTGCTT-3’, 
reverse: 5‘-CCTAGGTCCGCTCTCTATCG-3’) that amplify a 203-bp amplicon 
specifically on the wild-type allele, which contains a single nucleotide polymor- 
phism (SNP; rs29035891). After amplification, PCR products were gel-purified 
and sequenced using the forward or reverse primer used for PCR. Clones positive 
carrying the deletion were expanded and differentiated into NPC as previously 
described’® and subcloned by limiting dilution. NPC lines were maintained in 
N2B27 medium supplemented with EGF and FGF (10ngml each), on 0.1% 
gelatin-coated flasks. Clones carrying the boundary deletion on the inactive X were 
identified by RNA FISH against Xist with the p510 plasmid probe and DNA FISH 
with a BAC hybridizing inside the deleted region (RP23-299L1). 

RT-PCR and pyrosequencing. RNA extraction and cDNA synthesis were per- 
formed with the Cells-to-Ct kit (Ambion) following the manufacturer's instructions. 
Pyrosequencing primers were designed using the PyroMark Assay Design software. 
PCR products were purified and annealed with sequencing primers for pyrose- 
quencing using the PyroMark q24 (Qiagen) (Xist: forward primer 5‘-AGAGAGCC 
CAAAGGGACAAA-3’, reverse primer 5‘-TGTATAGGCTGCTGGCAGTCC-3’, 
sequencing primer 5‘-GCTGGCAGTCCTTGA-3’; Mecp2: forward primer 5’- 
CTGAAGGTTGTAGTGGCTCATG-3’, reverse primer 5’-ATGGTAGCTGG 
GATGTTAGGG-3’, sequencing primer 5’-CAGAGACAAGCCACTGA-3’; 
Huwel1:forward primer 5‘-GCAGGTGTCTGCAAATCCA-3’, reverse primer 
5'-GCCGATGTAAAGGCTCCAAG-3’, sequencing primer 5’-TGGGTTCATGT 
GACAG-3’; Jarid1c: forward primer 5’-GCTGCCTCCTTTGCCTGAAAT-3’, 
reverse primer 5‘-TTCAGGGGGCCGCTTACA-3’, sequencing primer 
5’-CTCCTTTGCCTGAAAT-3’). 

Hi-C read mapping, binning, ICE correction. Hi-C was performed as previ- 
ously described?”*, To obtain allele-specific Hi-C interaction maps in female ES 
cells (Xa°*'Xa!?°) and a derived clonal NPC line (Xa“'Xi!”?) (Methods; Extended 
Data Fig. 1)!°, we first constructed an allelic genome using the reference mm9 
genome and all 19,722,473 SNPs. The allelic (Cast and 129) genomes were then 
combined to create a reference diploid genome (consisting of 44 chromosomes; 
1-19 X,Y,M). All reads were aligned to the diploid genome (as described in 
ref. 29), thus allowing for a competitive mapping strategy between the two 
alleles. All reads were trimmed to 50 bp and then aligned using the novoCraft 
novoalign (v.3.02.00) software package. Reads were aligned using the following 
options (-r all 5 -R 30 -q 2 -n 50, minimumReadDistance = 5). The best align- 
ment was selected from the list of the top 5 alignments. The alignment was 
considered unique (allelic), if its alignment score was >5 from the second best 
alignment score (alignment score taken from the ZQ tag). Reads that aligned 
uniquely to an allele were classified as allelic (either Cast or 129) whereas reads 
that aligned to both alleles equally (<5 distance) were classified as ambiguous 
(AMB) (Extended Data Fig. 1d). Uniquely aligned Hi-C interactions between loci 
located on the same chromosome were assigned to a specific parental chromo- 
some in cis when at least one of the two reads contained a diagnostic SNP, and the 
other either contained a SNP from the same allele, or mapped to both alleles*’. 
We obtained the following paired-end read counts: For ES cells (GUR.2d), a 
total of 401,684,614 interactions could be aligned combining the two replicates, 
372,272,389 of which were unique (after PCR duplicate filter), and 95,650,438 
of which could be placed to either the Cast or 129 allele (25.69%). For NPCs 


(GEI.72b), a total of 277,440,656 interactions could be aligned, 253,254,798 of 
which were unique (after PCR duplicate filter), and 82,323,031 of which could be 
placed to either the Cast or 129 allele (32.51%). For AFT NPCs (D9B2/B129T3), 
a total of 229,331,123 interactions could be aligned, 222,941,525 of which were 
unique (after PCR duplicate filter), and 85,331,870 of which could be placed 
to either the Cast or 129 allele (38.28%). The difference in percentage of reads 
assignable to either allele is probably due to differences in the percentage of 
cis interactions found in each sample (biological or technical variation). The 
82-95 million read depth supported generation of allele-specific chromatin inter- 
action maps at multiple resolutions (10 Mb, 2.5 Mb, 1 Mb, 500kb, 250 kb, 100 kb 
and 40 kb). 

Biological replicates were highly correlated. Pearson's correlation coefficients 
for 500 kb data on chrX were as follows: EHSNP-mF1216__R1R2__chrX-129S1, 
0.992331; EHSNP-mF1216__R1R2__chrX-cast, 0.990373; EHSNP-mNPe- 
deltaRF__R1R2__chrX-129S1, 0.976562; EHSNP-mNPe-deltaRF__RIR2__ 
chrX-cast, 0.983614; EHSNP-mNPe__R1R2__chrX-129S1, 0.990976; 
EHSNP-mNPe__R1R2__chrX-cast, 0.995202. Autosomes showed similar corre- 
lation values. Overall these numbers indicate that the produced Hi-C data was 
of high quality and well correlated between biological replicates. We pooled all 
biological replicates into a single Hi-C data set per sample and subsequently used 
the pooled data for all analyses. 

Iterative mapping and error filtering/iterative correction of the chromatin 
interaction data were performed as previously described”’". Iterative correction 
was performed on the diploid (44 chromosomes) (replicate pooled) genome-wide 
matrix for all resolutions. 

Hi-C for the TXY (male) samples was performed as previously describe 
Reads originating from the TXY (male) sample was aligned to the mm9 reference 
genome. Iterative mapping and error filtering/iterative correction of the chromatin 
interaction data were performed as previously described”®*". Iterative correction 
was performed genome-wide (22 chromosomes) (replicate pooled) on the genome- 
wide matrix for all resolutions. Biological replicates were highly correlated. We 
pooled all biological replicates into a single Hi-C data set per sample and subse- 
quently used the pooled data for all analyses. 

We obtained the following paired-end read counts: For TXY +dox, a total of 
277,191,448 interactions could be aligned, 267,007,192 of which were unique (after 
PCR duplicate filter). For TXY WT —dox, a total of 308,671,996 interactions could 
be aligned, 300,102,244 of which were unique (after PCR duplicate filter). For 
TXY:AA +dox, a total of 281,116,218 interactions could be aligned, 273,612,976 
of which were unique (after PCR duplicate filter). For TXY:AA —dox, a total of 
298,436,664 interactions could be aligned, 289,376,893 of which were unique (after 
PCR duplicate filter). 

Allele-specific read mapping validation (Hi-C and ATAC-seq). To validate the 
accuracy of the allele-specific read alignment strategies used in this paper, we 
first constructed a set of validation reads tiled across all SNP locations between 
the Cast and 129 genomes. In brief, for each SNP location on the X chromosome, 
all overlapping 50-bp reads were extracted (50 total) for each of the 129 and Cast 
alleles (Extended Data Fig. 1i). All reads were then processed through the Hi-C 
and ATAC-seq mapping pipelines described in the methods to measure assign- 
ment accuracy. Encoded into each fastq readID, was the allelic genome that each 
read originated from, the relative-position (within the 50-bp read) of the SNP, the 
base-call of the SNP (A, C, T, G), the chromosome and position (start, end) of 
the read. The set of validation reads were then processed through the ATAC-seq 
allele-specific and Hi-C allele-specific pipelines with no modifications. Each pro- 
cessed read was then scored according to whether or not it was correctly placed 
to not only the correct chromosomal location, but to the correct allele as well. 
For the Hi-C allele-specific pipeline we found 0 reads assigned to the incorrect 
chromosomal location and 0 reads assigned to the incorrect allele. All processed 
validation reads were assigned to the correct coordinate and allele. Reads that 
span repetitive regions or are of low complexity were inherently filtered via the 
Hi-C pipeline and thus would be excluded from both the actual data and from the 
set of validation reads. For the ATAC-seq allele-specific pipeline, we found that 
for reads coming from the 129 X chromosome, only 0.09% mapped to the Cast 
chromosome (only 3 of these fell within ATAC-seq peaks). For reads coming 
from the Cast X chromosome, only 0.21% mapped to the 129 chromosome (5 of 
which fell within ATAC-seq peaks). Furthermore, we feel that our analysis may be 
even more accurate when using paired-end reads as we do for all ATAC-seq data 
analysis (Extended Data Fig. 1i). 

Generation of Xist-positive Hi-C signal for comparison with DNA FISH. Xist 
RNA FISH performed in parallel with Hi-C on the same inducible Xist ES cell 
samples revealed that Xist expression (either wild-type or A-repeat mutant) was 
induced in ~35-45% of dox-treated cells. Hence the Hi-C signal can be repre- 
sented as the sum of ~35-45% of reads coming from Xist-positive X chromosomes, 


27:28 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


and ~55-65% of reads that are generated from non-coated X chromosomes. For 
example, 


Hi-C(+dox) = 35% x Hi-C(Xist+) + 65% x Hi-C(Xist—) 


We took advantage of the fact that the signal from Xist— cells is measured inde- 
pendently in the non-induced (—dox) sample: 


Hi-C(Xist—) = Hi-C(—dox) 
to extract the signal of Xist-positive cells: 
Hi-C(+dox) = 35% x Hi-C(Xist+) + 65% x Hi-C(—dox) 
and hence 
Hi-C(Xist+-) = [Hi-C(+dox) — 65% x Hi-C(—dox)] x (1/0.35). 


Negative values were assigned to ‘not analysed’ (NAs). The Hi-C(Xist+) signal 
was then used to compare Hi-C data with RNA/DNA FISH experiments in Fig. 4b, 
which allow to discriminate Xist-coated and non-coated chromosomes visually 
by the presence of an Xist cloud. In Fig. 4c, Xist— signals correspond to the —dox 
sample. 

Hi-C SNP density filter. To remove potential biases in the Hi-C data related to 
the density of SNPs in each bin, we calculated the number of SNPs residing in each 
genomic interval (bin) for all Hi-C bins across all bin sizes. We then calculated the 
median number of SNPs per bin, and produced a minimum required SNP density 
cutoff defined as the (median — 1.5 x IQR). Any bins with less SNPs than the cutoff 
were removed from all analyses. The SNP density cutoffs used for each bin size 
were: 40 kb, 43 SNPs; 100 kb, 216 SNPs; 250 kb, 776.5 SNPs; 500 kb, 1,767.25 SNPs. 
The non-SNP-density-filtered data was only used for visualization purposes (figure 
heat maps). Throughout the manuscript, we refer to Hi-C as data that has been 
iteratively corrected*! and run through the SNP-density filter. 

Compartment analysis. The presence and location of the A/B compartments were 
calculated as previously described**. Compartments were derived from the 250-kb 
iteratively corrected Hi-C data for each chromosome separately using the CIS maps 
for each sample/allele. The code used to generate the compartments (PC1 from 
PCA analysis) is publicly available on Github (matrix2compartment.pl): https:// 
github.com/dekkerlab/giorgetti-nature-2016. Compartments were generated all 
default options except the (cis alpha) option, set to (-ca 0.005). 

Insulation and boundary calculation. TAD structure (insulation/boundaries) 
was defined via the insulation method as previously described with minor 
modifications*”. The code used to calculate the insulation score is publicly avail- 
able on Github (matrix2insulation.pl): https://github.com/dekkerlab/giorgetti- 
nature-2016. Insulation vectors were detected using the following options: 
(-is 480000 -ids 320000 -im iqrMean -nt 0 -ss 160000 -yb 1.5 -nt 0 -bmoe 0). The 
output of the insulation script is a vector of insulation scores, and a list of minima 
along the insulation vector (inferred as TAD boundaries). The TAD boundaries 
were not used in this study. 

Interaction pile-up maps. Interaction pile-up maps were constructed from all 
pairwise interactions between either the list of 87 wild-type NPC Xi chromosome 
escapees or the 29 AFT NPC Xi chromosome escapees. Using the 40 kb Hi-C data, 
a 2-Mb window centred around each pairwise interaction (pixel) was taken (25 
bins in each direction, yielding 51 x 51 sub-matrix). Any resulting sub-megabase 
that overlapped the (y= x) diagonal in the matrix was excluded from the analysis 
(effectively excluding all interactions <2 Mb). All sub-matrices were then averaged 
to produce the final (mean) pile-up map. A strong signal at the centre suggests that 
the elements used tend to contact one another in 3D space. 

The Xi chromosome is as accessible and detectable in Hi-C as the Xa chromo- 
some and autosomes. The number of RAW reads observed for both the Xa and 
Xi chromosomes were very similar for all chromosomes, thus demonstrating that 
the Xi chromosome is not simply less accessible/visible to the Hi-C methodol- 
ogy. ES-cell-chrX-129S1, 1,118,327; ES-cell-chrX-Cast, 1,104,709; NPC-chrX- 
12981, 1,147,072; NPC-chrX-Cast, 1,148,128; AFTNPC-chrX-129S81, 1,314,476; 
AFTNPC-chrX-Cast, 1,288,802. Bias in read directional due to partial digestion is 
typically observed up to ~10kb. For interactions between fragments separated by 
over 10kb this bias is negligible, indicating at least one digestion occurring between 
them in every cell. This genomic distance is therefore a measure for digestion effi- 
ciency”. For both the Xa and Xi chromosomes, this genomic distance is ~6-10 kb, 
indicating that digestion efficiency of chromatin on the Xa and Xi chromosomes 
are comparable. Thus, the unique conformation of the Xi chromosome does not 
affect Hi-C analysis, as was also found for condensed mitotic chromosomes?’. 
Correlation analysis of Hi-C matrices. We compared X chromosome interaction 
matrices of Xist-inducible lines (pre/post-Xist induction in wild-type and A-repeat 
mutant samples) with that of the NPC Xi chromosome, at 500-kb resolution. As 
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distance-dependent decay of interaction frequency causes all interaction matrices 
to be highly correlated, we first compensated for this effect by multiplying the 
read count in each bin by its respective genomic distance. We then calculated the 
Spearman correlation between each pair of matrices. After wild-type Xist induc- 
tion in TXY cells, the interaction map becomes more similar to that of the NPC 
Xi chromosome (rho = 0.17 to 0.31), while no change is observed in the repeat-A 
mutant (rho=0.17 to 0.17). 
RNA and 3D-DNA FISH. FISH was performed as previously described**. ES cells 
and NPCs were cultured on gelatin-coated coverslips #1.5 (1mm) and fixed in 3% 
paraformaldehyde for 10 min at room temperature. Cells were permeabilized on 
ice for 5 min in 1x PBS, 0.5% Triton X-100 and 2mM vanadyl-ribonucleoside 
complex (VRC, New England Biolabs), and coverslips were stored in 70% ethanol 
at —20°C. Before FISH, samples were dehydrated through an ethanol series (80%, 
95%, 100% twice) and air-dried briefly. For RNA FISH, cells were directly hybrid- 
ized with denatured probes. For DNA FISH, samples were first denatured in 50% 
formamide/2 x SSC (pH =7.3) at 80°C for 37 (ES cell) and 35 (NPC) min, immedi- 
ately placed on ice and washed twice with ice-cold 2x SSC. After overnight hybrid- 
ization at 37°C for RNA FISH or 42°C for DNA FISH, coverslips were washed at 
42°C for RNA or 45°C for DNA, three times for 5 min in 50% formamide/2 x SSC 
at pH 7.3, and three times for 5 min in 2x SSC. Nuclei were counterstained with 
0.2mgml-! DAPI (2 mgm’ for structured illumination microscopy), further 
washed twice for 5 min in 2 SSC at room temperature and finally mounted with 
90% glycerol, 0.1 PBS, 0.1% p-phenylenediamine at pH9 (Sigma). 
RNA FISH probes. We used the p510 plasmid coupled with Cy5 to detect Xist. 
For RNA FISH on escape genes, we used the following BAC and fosmid probes: 
RP23-436K3, RP23-328M22, RP24-436K3, WI1-12690 10 (Mecp2), RP24-157H12 
(Huwe1), RP23-13D21 (Gépdx), RP24-148H21 (Jarid1c). 
DNA FISH probes. In experiments to detect the mega-domain boundary, 
fluorescent oligonucleotides (average length 45 bp, 5’-modified with Atto 448 or Atto 
550, average density: one oligonucleotide every 3 kb) were obtained from MYcroarray 
Inc. Oligonucleotides were designed to tile the following consecutive 18-Mb regions: 
chrX:35,000,000-53,000,000, chrX:53,000,000- 72,000,000, and chrX:72,000,000- 
90,000,000. To detect the DXZ4 region we used the RP23-299L1 BAC. 
Imaging and quantification of 3D DNA FISH. Three-dimensional image stacks 
(200 nm distance between consecutive xy planes) were acquired on a DeltaVision 
Core wide-field microscope (Applied Precision) equipped with a CoolSNAP HQ2 
camera operated at 2X binning, and a 100 PlanApo oil immersion objective 
(the effective pixel size was 129 x 129nm). Xi chromosome signals were iden- 
tified via the presence of an Xist mRNA cloud in the far-red channel (p510-Cy5 
probe). Pearson correlation between red and green signals was calculated using 
custom-made ImageJ macros as follows. After subtracting the background from 
each xy plane (generated by morpholoigcal opening the image with a circle of 
5 pixels in radius), Pearson correlation between red and green pixel intensities was 
measured inside a fixed-size region of 40 x 40 x 20 pixels (5.16 x 5.16 x 44um?) 
centred on each FISH signal. The significance of Xi versus Xa chromosome dif- 
ferences in correlation was assessed by Wilcoxon's rank sum test. Random nuclear 
positions were used to estimate the background correlation that could be observed 
due to non-specific probe hybridization. 

The gyration tensor of a greyscale image is defined as 


Sab= Mere — 19) (rh = 16") YT 
k k 


where k is an index running over voxels, I; is the greyscale intensity of voxel k, and 
rk andr are the a-th components (x, y, or z) of the xyz position of voxel k, and 
of the centre of mass of the image, respectively. The gyration tensor was valuated 
in a region of interest of 3.8 x 3.8 x 4 jum? centred on each FISH signal and the 
gyration radius was calculated as 


Rg = JAit+ A2+ A3 


where j,2,3 are the eignevalues of Sap. 

RNA-seq. RNA-seq data for the ES cell (GUR.2d) and NPC (GEI.72b) was obtained 
from previously published work (PMID 24576422)!°. RNA-seq data for the mutant 
NPC (D9B2/B129T3) was obtained and processed as previously described'®. 
RNA-seq ‘expressed/escapee’ classification. The allelic RPKM values 
were derived for each gene by splitting the RPKM value by the 129 ratio. 129 
RPKM = (RPKM x 129 ratio); Cast RPKM = (RPKM x (1 — 129 ratio)). Any gene 
with an allelic RPKM value >3 RPMK was classified as being expressed. Any gene 
expressed on the Xi chromosome was classified as being an escapee. 

ATAC-seq. ATAC-seq library preparation was performed exactly as previously 
described**. Sequencing was carried out on an Illumina NextSeq 500 generating 
2x75 bp paired-end reads. Libraries were sequenced to a depth of 25-35 million 
reads per sample. Reads were trimmed using CutAdapt and aligned using Bowtie2. 
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Reads were aligned to a custom 129/CastEiJ genome in which SNP sites were 
replaced by ‘N. Approximately 52-58% of reads per line contained ‘N’s and were 
assigned to the 129 or Cast allele based on the identity of the base at that location. 
Reads containing non-concordant SNPs were rare and were discarded. Reads not 
containing SNP sites were included in overall peaks but not were excluded from 
allele-specific tracks. ATAC-seq peaks were called using MACS2 with no shifting 
model. For TXY Xist-inducible male cell lines, which contain only one X chro- 
mosome, peaks were called after normalizing all samples by read count on the 
autosomes. The set of X chromosome peaks was defined as the merge of peaks from 
all samples, and bedtools was used to calculate coverage within peaks. 
Assigning allele-specific ATAC-seq peaks. For each ATAC-seq peak, all 
N-containing reads were counted and assigned to 129 or Cast alleles based on 
SNP at the N-containing position. To assign monoallelic and biallelically accessible 
peaks, a d-score was calculated as a measure of allelic imbalance”. In brief, for a 
given peak the d-score was calculated as the ratio of 129 reads to total number of 
reads — 1/2. A peak with a d-score > 0.3 was assigned as a 129-specific peak. A 
peak with a d-score < —0.3 was assigned as a Cast-specific peak. Any peak with a 
d-score > —0.3 was assigned as a peak in 129 (monoallelic or biallelic). Any peak 
with a d-score < +0.3 was assigned as a peak in Cast (monoallelic or biallelic). 
Annotating ATAC-seq peaks using ChIP-seq data. ATAC-seq peaks were anno- 
tated using existing published ChIP-seq data sets. CTCF ChIP-seq came from 
whole female mouse brain**. Called CTCF binding sites were used and extended 
+300 bp before overlapping with ATAC-seq peaks. H3K27ac and p300 ChIP-seq 
are from mouse NPCs*”. For H3K27ac and p300 ChIP-seq data, peaks were called 
using MACS2 and then overlapped with ATAC-seq peak locations. 

Integrating Hi-C, ATAC-seq and RNA-seq data. Integrative analysis of Hi-C 
insulation (TAD structure), ATAC-seq counts, and RNA-seq RPKM was per- 
formed as follows. A promoter region was defined for each gene as +500 bp from 
the TSS. ATAC peaks were assigned to a gene if they overlapped with the promoter 
region. In the event that >1 ATAC peak overlapped with the promoter, the closer 


ATAC peak was chosen. An ATAC count of 0 was assigned to each promoter, if it 
did not contain an ATAC peak. If the ATAC allelic counts overlapping the promoter 
were <10, then the ATAC count was set to ‘NA’ The 40-kb bin overlapping the 
promoter region was used to display the insulation and insulation-difference value. 
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Extended Data Figure 1 | Description of allele-specific Hi-C and 
ATAC-seq. a, Schematic of hybrid mouse strains used for all experiments. 
b, Top, scheme outlining differentiation of ES cells to NPCs and picking of 
clones. Bottom, scheme outlining CRISPR deletion of the mega-domain 
boundary in ES cells, differentiation to NPCs and the picking of clones. 

c, Schematic of Hi-C library generation. d, Schematic of the Hi-C 
alignment strategy. Paired-end reads are aligned to a ‘diploid’ genome 
consisting of 22 chromosomes from Cast, and 22 chromosomes from 129 
(1-19 X, Y, M). The interaction row shows all possible paired-end read 
combinations between the 129, Cast and ambiguous (AMB) genomes. 

e, Schematic showing the re-assignment of a subset of ‘cis’ interactions. 
Paired-end reads in which one side uniquely aligned to an allele and 

the other side aligned equally to both alleles (AMB), were re-classified 


as allelic reads, only if both reads aligned to the same chromosome 

(cis). f, Cartoon explaining the re-assignment of 129:amb or cast:amb 

cis interactions. g, Scheme for ATAC-seq library preparation. Cells 

are lysed followed by incubation with adaptor-loaded hyperactive Tn5 
transposase. The transposase integrates into accessible DNA, and these 
fragments are then directly amplified and sequenced. h, Scheme for allele- 
specific ATAC-seq data analysis. i, SNP-mapping simulation. For each 
SNP location on the X chromosome, all overlapping 50 bp reads were 
extracted (50 total) for each of the 129 and Cast alleles. All reads were then 
processed through the Hi-C and ATAC-seq mapping pipelines described 
in the Methods to measure assignment accuracy. Results are shown in 

the table. 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Structure of the Xa chromosome in ES cells, 
compartment analysis and characterization of interactions between 
genes that escape XCI. a, Allele-specific Hi-C contact maps for 

X chromosome in ES cells at 500-kb resolution (top), and for a ~40-Mb 
region centred around the DXZ4-containing locus at 40-kb resolution 
(bottom). The insulation score is plotted at the bottom of each 40-kb 
heat map as in Fig. 1a. Purple shaded areas indicate the IQR of insulation 
scores along the chromosome. b, Zoomed in view of three regions showing 
Hi-C interactions, RNA-seq and ATAC-seq signal on the Cast and 129 
Xa chromosomes in ES cells. c, Compartment profiles of X chromosome 
in ES cells and NPCs. The first eigenvector (PC1) of each allele-specific 
Hi-C contact map, obtained with principal component analysis, is 
shown, together with the difference in chromosome-wide insulation 
score between the 129 and Cast allele. A/B-compartments are evident in 


ES cells and NPCs along both Xa (red and blue signal), whereas the first 
eigenvector corresponds to the two mega-domains for the Xi chromosome 
in NPCs. In ES cells, both Xa chromosomes display comparable insulation 
profiles (difference is close to 0 along the chromosome), whereas in NPCs 
large differences are observed (difference in insulation fluctuates along the 
chromosome). Grey areas indicate regions with low SNP density that were 
excluded from analysis. d, Same plots as in Fig. 2e for the same set of genes 
that escape XCI in NPCs, but using Hi-C data obtained in ES cells. e, As in 
d, where the interactions between loci that escape XCI on the wild-type Xi 
chromosome were divided in three different groups, according to whether 
pairs of loci belong to the same or different mega-domains. Interactions 
across the mega-domain boundary only occur on the wild-type Xi 
chromosome and are lost on the AFT Xi chromosome concomitant with 
loss of transcription of constitutive escapees. 
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Extended Data Figure 3 | Comparison of allele-specific Hi-C maps of vector are found towards the centre of each TAD. The insulation difference 
chromosomes X and 13 in ES cells, NPCs and boundary-mutant AFT plot highlights areas of differential TAD structure between the alleles 
NPCs. a, Hi-C data, insulation scores, and the difference in insulation (many differences along the X chromosome in NPCs as compared to the 
scores (129 Cast) are shown for ES cells, NPCs and mutant AFT NPCs allelic differences along autosomes, see b). b, Same as a for chromosome 13. 
for both alleles (Cast and 129) for the X chromosome. Large dips in the The insulation difference plot highlights areas of differential TAD 
insulation vector are found at TAD boundaries. Peaks in the insulation structure between the alleles (rare). 
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Extended Data Figure 4 | Quantitative analysis of 3D DNA FISH on the 
Xa and Xi chromosomes. a, Top left, scheme of the procedure used 

to quantify the Pearson correlation. A background is generated for each 

xy plane in a three-dimensional z-stack by morphological opening the 
image with a circle of 5 pixels in radius, and subtracted from it. Pearson 
correlation between red and green pixel intensities is measured inside a 
fixed-size region of 40 x 40 x 20 pixels (5.16 x 5.16 x 441m) centred on 
each FISH signal. To demonstrate that background subtraction does not 
affect the measured correlations, we show here a line-scan of 10}1m across 
a typical DNA FISH signal (top right). The shape of the signals along the 
line scan, as well as their relative intensities, is not affected by background 
subtraction (bottom). Pre- and post- refer to line scans taken on images 
before and after background subtraction, respectively. b, In more than 

80% of nuclei in NPCs, Pearson correlations are higher on the Xi than on 
the Xa chromosome. Shown is NPC clone C2 (the same where Hi-C was 
performed). c, Same quantification as in Fig. 1c (and b) for an independent 
NPC clone (E1) where the active X is on the 129 allele and the inactive X 
chromosome on the Cast, and in astrocytes derived from NPC clone C2. 

n denotes the number of cells analysed in DNA FISH. Centre lines represent 
medians; crosses denote experimental points that were considered as 
outliers (larger than (q3 + 1.5 x (q3 — q1) or smaller than q1 — w(q3 — q1), 
where q1 and q3 are the twenty-fifth and seventy-fifth percentiles of the data 
distribution, respectively). d, Scheme of the gyration tensor based analysis 


of FISH volumes (see Methods). e, Left, gyration radii of DNA FISH signals 
from probes a, b and c. Probe b was used in combination with both probes 
a and c separately in two independent experiments. Statistical significance 
was assessed by Wilcoxon's rank sum test (*P < 0.05, **P<1 x 10~°). 

The mean gyration radii for Xa and Xi chromosome signals are indicated 
by dotted lines as a guide for the eye. On the Xi chromosome, the mean 
gyration radius is approximately 6% smaller than on the Xa chromosome 
corresponding to approximately 20% smaller volume. Right, representative 
images of probe a, showing smaller size and increased roundness of the Xi 
chromosome signals. f, Left, scheme of the thresholding-based method for 
volume quantification. Thirty increasing threshold levels were imposed, 
starting from the residual grayscale background level surrounding the 
signal, up to the minimum between the red and green channel grayscale 
maxima. For each of these thresholds we determined the number of voxels 
in each channel, where the greyscale intensity was higher than the threshold. 
Centre, the fraction of cells where the Xa chromosome signal is larger than 
the Xi chromosome is between 60% and 80% in the entire threshold range. 
Right, in a wide range of thresholds, the volume of Xa chromosome signals 
is approximately 25% bigger than Xi chromosome signals. Results are shown 
here for probes a and b; the same holds for probe c (not shown). n denotes 
the number of cells analysed in DNA FISH. Centre lines: medians. Boxes: 
middle 50% of data points. Two biological replicates were analysed for each 
experiment. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Integrative analysis of allele-specific Hi-C, 
RNA-seq and ATAC-seq. a, X-chromosome-wide ATAC-seq and RNA- 
seq in ES cells and NPCs. ATAC shows signal for ambiguous, 129- and 
Cast-specific reads in ES cells and NPCs. RNA-seq shows total signal as 
well as expressed gene calls. ATAC-seq shows global loss of chromatin 
accessibility and expression on the Xi chromosome, except at specific 
locations that mostly overlap with escape genes. Dotted line denotes mega- 
domain boundary. Position of constitutive escapees was adapted from ref. 3. 
b, Escape genes on the Xi chromosome (as determined by RNA-seq) fall 
within regions with high ATAC-seq signal (Kolmogorov-Smirnov test 
P<2.2x 10 '*). c, Pie charts showing the distribution of peaks that escape 


XCI versus the peaks that are unique to the Xa chromosome. Peaks are 
classified into those that are promoter-proximal (within 5 kb of TSS) and 
distal (>5 kb from TSS). Annotations are based on binding sites identified 
by ChIP-seq?”**. d, Scatter plot showing the relationship between TAD 
signal (insulation score) and number of expressed genes of the wild-type 
NPC 129 (Xi) Hi-C data set. The y axis shows the mean insulation score in 
a 480-kb window with a 40-kb step size. The x axis shows the number of 
expressed genes in a 480-kb window with a 40-kb step size. The Pearson R 
value (0.395) is shown above and is calculated on only the middle 90% (red 
points) of the residuals. Outliers are shown in black (bottom 5% and top 
5% of residuals) 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


RP23-328M22 


= 
iz RP23-436K 
5 @ 
fe) iS oO 
8 s Sante os 
ot 2g ° BSEgSSa ae] 
fi alg e3 SEES y SER ea's S 55 
& Poo -< od Srurk&otasZ205 =I 
129/Cast F1 hybrid : 
ES call line XX s 
= 
Deletion of the 9 
128 boundary Boundary region 
Centromeric megadomain Boundary region Telomeric megadomain RNA FISH 
Jarid1¢ RP23-436K3 
Quy RPS-75001 Xist RP23-328M22 Xist Mecp2 Xist Huwet 


72,823,838 ‘ACaat 73,025,533 
['129 deletion SNP: 72,823,822 C/T 


WT NPG 


AGCGRACAG AGCGAACAG 


Screen WT allele 
of the deleted clones| \ ‘i 
clone atrort bone 


129 deletion 


ES colony picking 


Y Differentiation (FGF2,EGF) 


AFT NPC 


NPC AFT clone D9B2 


c 
Facultative escape genes (NPC-specific) : 
100% i 
2 80% : no signal 
—— iS 60% ; anes ul on Xi) 
Bre allelic (active on Xi 
S ee PY a 40% 
lL. 20% 
wr A Wr A WIA WTA : WT A 
NPC colony picking RP23- RP23- Mecp2 Huwet  Jarid1C 
328M22 436K3 
d WT clones AFT clones 
Diff 2 subcloning =Cast 4 WT clone with no escape 
si, Diff 1 Diff 2 subcloning 1 Diff 2 subcloning 2 Diff 1 1 2 =129 4 AFT clone with no escape 
90% 
80% 
70% 
BS 60% 
50% 
x< 40% 
30% 
a Monoallelic 
a Threshold 
nae Number of clones 
20% with no escape: 
70% 
oa 60% WT: 3/56 (5%) 
@ wn AFT: 4/17 (24%) 
5 = 30% 
o se Monoallelic 
v eT Threshold 
& mn Tih i 
~ 100% 
a 6K Number of clones 
Zz 20% with no escape: 
a ~ 70% 
5 gm WT: 10/56 (18%) 
& Zin AFT: 4/17 (24%) 
30% 
si Monoallelic 
0 Threshold 
i vy vyyy) vy A A 
100% 
90% 
80% 
70% 
O 60% 
S 50% 
S 40% 
g 30% 
20% 
10% 
0% es — — — — — ~ —— 
Extended Data Figure 6 | Analysis of wild-type and boundary-mutant expression of facultative (Mecp2 and BAC probes) but not constitutive 
AFT NPCs by FISH and RT-PCR followed by pyrosequencing. (Jarid1c) escapees is lost on the AFT Xi chromosome. c, Quantification of 
a, Scheme of the strategy used to delete the mega-domain boundary region _ the RNA FISH experiment in b. d, RT-PCR followed by pyrosequencing of 
in ES cells and to derive AFT NPCs. b, RNA FISH against constitutive Xist, two facultative escape genes (Huwel and Mecp2) and the constitutive 
and facultative escapees confirms RNA-seq and ATAC-seq results in the escapee Jarid1c indicate the percentage of expression from the 129 allele 
AFT NPC clone D9B2. Top, the positions of BAC probes (RP23-328M22 (red) or the Cast allele (blue) in 56 wild-type and 17 AFT NPC clones. 
and RP23-436K) are shown relative to the escape genes that they span. Absence of Mecp2 and Huwel escape is observed in a moderately higher 
Coloured gene names correspond to transcripts that were detected with proportion of AFT than wild-type clones. Absence of Jarid1c escape is 
specific fosmid probes. Bottom, sample RNA FISH images showing that never observed. 
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Extended Data Figure 8 | 3D DNA FISH and integrative ATAC-seq/ 
RNA-seq analysis in AFT NPCs. a, Top left, scheme of the DNA FISH 
probe sets (a—b: inside the same mega-domain, b-c: across the boundary). 
Bottom, loci detected by probe set b-c are more interacting in the AFT 
than in the wild-type Xi chromosome both in Hi-C (left) and in 3D-DNA 
FISH (right), showing loss of mega-domain boundary. *P<2 x 104, 

**P <1 x 1075 (Wilcoxon's rank sum test corrected with Bonferroni 

for multiple hypothesis testing). Right, sample RNA/DNA FISH images 
showing that signals from probe set b-c are more overlapping on the AFT 
Xi than on the wild-type Xi chromosome. Two biological replicates were 
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analysed. b, Chromosome-wide ATAC-seq and RNA-seq signal generated 
with ambiguous (mm49), 129- and Cast-specific reads in wild-type NPCs 
and AFT NPCs, showing global loss of chromatin accessibility on the 
AFT Xi chromosome except at the X-inactivation centre (Xic) region 

and constitutive escape genes. Dotted line indicates the position of the 
mega-domain boundary. c, Zoomed-in view of a region on the AFT Xi 
chromosome encompassing the Xic showing Hi-C interactions, RNA-seq 
and ATAC-seq signal. ATAC-seq from wild-type NPCs is included for 
reference (previously shown in Fig. 1). 
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Extended Data Figure 9 | 3D DNA FISH and integrative Hi-C/ 
ATAC-seq analysis of inducible Xist ES cell lines. a, Left, schematic 
representation of TXY and TXY:AA male cell lines, carrying a 
tetracycline-inducible wild-type and A-repeat mutant Xist, respectively, at 
the endogenous Xist locus. Centre right, RNA FISH against G6pdx and a 
group of genes recognized by the RP23-436K BAC (see Extended Data 
Fig. 6b) showing that expression of X-chromosome-linked genes is lost upon 
induction of wild-type but not A-repeat mutant Xist in male ES cells. TXY 
and TXY:AA were treated with doxycycline for 48h. b, Representative 
images from RNA/DNA FISH experiments in TXY and TXY:AA male 

ES cell lines, showing increased overlap of probes a-b on the Xist-coated 
X chromosome after 48 h of wild-type, but not AA Xist induction. 

c, Left, schematic representation of TX1072 female ES cells in which Xist 
expression can be induced via a tetracycline-responsive promoter at one 
of the two endogenous Xist loci. Center right, RNA/DNA FISH in TX1072 
cells treated for 3 days with doxycycline. Probes a—b overlap more on the 
Xist-coated than on the wild-type X chromosome, whereas signals from 
b-c show lower overlap and partitioning of the Xist-coated chromosome 


LETTER 


into two separate domains. *P <1 x 1077 (Wilcoxon’s rank sum test 
corrected with Bonferroni for multiple hypothesis testing). n denotes the 
number of cells analysed in DNA FISH. Centre lines: medians. Boxes: 
middle 50% of data points. Two biological replicates were analysed. 

d, Correlation between changes in Hi-C interactions and ATAC-seq signal 
following 48-h wild-type Xist induction in male ES cells (TXY cell line). 
logo(+dox/—dox) was calculated for bins of 40 kb (left), 100 kb (middle), 
and 500 kb (right). e, Model of mega-domain boundary-mediated control 
of chromosome folding and facultative escape. Xist coating causes gene 
silencing and initiates chromosome-wide conformational changes in a 
manner dependent on its A-repeat region, which result in formation of 
mega-domains, overall compaction of chromosome folding, and loss 

of TADs. During differentiation, transient interactions with the mega- 
domain boundary may occur and may contribute to facultative escape and 
re-establishment of TADs at facultative escape loci, although the presence 
of residual TADs at escapee loci on the Xi chromosome is not necessarily 
due to the mega-domain boundary. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature17621 


Corrigendum: An observational 
radiative constraint on hydrologic 
cycle intensification 

Anthony M. DeAngelis, Xin Qu, Mark D. Zelinka & Alex Hall 


Nature 528, 249-253 (2015); doi:10.1038/nature15770 


After publication of this Letter, we identified a coding error in the 
calculation of values that appear in Extended Data Fig. 7c and d, and 
which are reported in the main text. The error has the effect of over- 
stating the broader implications of our results for a twenty-first-century 
climate change scenario. The calculation in Extended Data Fig. 7 
represents the impact of the bias in simulated shortwave absorp- 
tion on model-projected precipitation changes under the RCP8.5 
scenario. The corrected values (the Supplementary Information to 
this Corrigendum shows the corrected Extended Data Fig. 7) indi- 
cate that removing the bias in shortwave absorption reduces the 
ensemble-mean precipitation increase per unit warming by 20%, rather 
than by about 40%, as stated in the first and last paragraphs of the Letter. 
When not normalized by warming, the reduction is 19% rather than 
about 25% (as stated in the last paragraph of the Letter). In the revised 
calculation, there is no reduction in model spread in precipitation increase 
per unit warming under RCP8.5 after removing the bias in shortwave 
absorption. This is in contrast to the 35% reduction stated in the first and 
last paragraphs of the Letter. However, for the precipitation increase not 
normalized by warming, the correction shows a spread reduction of about 
10% (rather than about 25% as stated in the last paragraph of the Letter). 

The lack of spread reduction in the corrected Extended Data Fig. 7c after 
removing the bias in shortwave absorption is mostly the consequence of 
two models: GISS-E2-H and GISS-E2-R (numbers 12 and 13 on the plot). 
These two closely related models have a large negative rapid precipitation 
adjustment to carbon dioxide forcing that is highly anomalous compared 
to other models (Extended Data Fig. 3b). These models also have the larg- 
est positive bias in temperature-mediated precipitation increase (Extended 
Data Fig. 7b). These two effects strongly compensate for the total precipita- 
tion increase per unit warming under RCP8.5 (see the abscissa of Extended 
Data Fig. 7c), such that these models do not contribute much to the spread 
prior to removing the bias. When the large bias in temperature-mediated 
response is removed, these models now have a much smaller precipitation 
increase per unit warming compared to other models (see the ordinate of 
the corrected Extended Data Fig. 7c in the Supplementary Information to 
this Corrigendum), contributing to substantial spread in the constrained 
quantity. In short, the impact of the spread in rapid precipitation adjust- 
ment, particularly from the GISS models, becomes more apparent after 
correcting for the shortwave absorption bias. When the GISS models are 
omitted from the analysis, the spread in precipitation increase, whether 
normalized by warming or not, decreases by about 25% after correcting 
for the shortwave absorption bias (see numbers in parentheses in the cor- 
rected Extended Data Fig. 7c and d). Further research is warranted to 
assess the cause of the large rapid precipitation adjustment in the GISS 
models and whether or not it is physically reasonable. 

The corrected legend to the Extended Data Fig. 7 is in the 
Supplementary Information to this Corrigendum. The following sen- 
tences have been added: “In c and d, the ratio of constrained to original 
model spread and ensemble mean is printed below the key. Numbers 
in parentheses are the ratios computed after omitting the GISS-E2-H 
and GISS-E2-R models (numbers 12 and 13)”. 


Supplementary Information is available in the online version of the Corrigendum. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature17642 


Corrigendum: Viraemia 
suppressed in HIV-1-infected 
humans by broadly neutralizing 
antibody 3BNC117 


Marina Caskey, Florian Klein, Julio C. C. Lorenzi, 

Michael S. Seaman, Anthony P. West Jr, Noreen Buckley, 
Gisela Kremer, Lilian Nogueira, Malte Braunschweig, 
Johannes F. Scheid, Joshua A. Horwitz, Irina Shimeliovich, 
Sivan Ben- Avraham, Maggi Witmer-Pack, Martin Platten, 
Clara Lehmann, Leah A. Burke, Thomas Hawthorne, 
Robert J. Gorelick, Bruce D. Walker, Tibor Keler, 

Roy M. Gulick, Gerd Fatkenheuer, Sarah J. Schlesinger & 
Michel C. Nussenzweig 


Nature 522, 487-491 (2015); doi:10.1038/nature14411 


In this Letter, the grant U19AI111825-01 Cooperative Centers on 
Human Immunology from NIH was erroneously included in the 
Acknowledgements section; this has been corrected in the online 
versions of the paper. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature17641 


Corrigendum: Inactivation of PI(3)K 
p1106 breaks regulatory T-cell- 
mediated immune tolerance to 


cancer 


Khaled Ali, Dalya R. Soond, Roberto Pifieiro, 

Thorsten Hagemann, Wayne Pearce, Ee Lyn Lim, 
Hicham Bouabe, Cheryl L. Scudamore, Timothy Hancox, 
Heather Maecker, Lori Friedman, Martin Turner, 

Klaus Okkenhaug & Bart Vanhaesebroeck 


Nature 510, 407-411 (2014); doi:10.1038/nature13444 


Queen Mary University London notified Nature and University College 
London that there is reason to question the provenance of the data 
for Fig. 5b, d, e of this Letter (Fig. 5a, c data are unaffected). Ongoing 
studies are investigating the reported effect of p1106 inhibition in the 
pancreatic cancer mouse model. We therefore wish to withdraw these 
figure panels and associated text from the published paper. This does 
not affect the overall conclusion of the manuscript or any of the other 
experiments performed for this study as they repeat effects shown using 
other model systems in the paper (see Supplementary Information for 
raw data of experiments). We apologise for the inconvenience that this 
may have caused. 


Supplementary Information is available in the online version of the Corrigendum. 
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Amazonian community members collaborate with Wayne Walker to measure forest carbon. 


COLLABORATIONS 


Partners in 
knowledge 


Building relationships with indigenous people opens up 
paths to good research — and mutual benefit. 


BY GABRIEL POPKIN 


one-fifth of the world’s land, and scien- 
tists who work in fields such as climate, 
ecology and astronomy can enrich their 
research by collaborating with the native resi- 
dents. Indigenous people have long helped 
scientists to gain access to study sites and to 
local knowledge about everything from for- 
est plants to astronomical observations to 
cultural traditions. 
But alliances can be tricky to navigate when 


| Beer territories comprise roughly 


past missteps have strained relations between 
visiting scientists and residents. Indigenous 
people rarely show up on author lists of 
studies, for example, and that slight — among 
others — has caused some community 
members to be wary. 

That is beginning to change as members of 
indigenous groups around the world assert 
their roles as producers and preservers of 
knowledge, and are increasingly starting 
to earn university degrees and work as aca- 
demic researchers. Nonetheless, scientists 
who will be doing fieldwork at indigenous 


sites must proceed with care and respect (see 
“Mutual benefit’). They should plan ahead to 
meet community members and secure their 
cooperation and consent to access sites and 
collect samples. They must incorporate and 
acknowledge community members’ contribu- 
tions to their work, and be open to substantive 
collaborations that go beyond just requesting 
knowledge or logistical support. 

Eske Willerslev, director of the Center for 
GeoGenetics at the University of Copen- 
hagen, discovered the importance of such 
steps in 2011. He was preparing to publish an 
analysis showing that Australia’s Aboriginal 
people had arrived there long before any 
known group had left Africa or the Middle 
East. The work hinged on DNA from a hair 
sample — originally collected in the 1920s — 
that he had found at the University of Cam- 
bridge, UK. But his publication was imperilled 
when a co-author threatened to withdraw 
unless Willerslev secured formal publishing 
consent from living descendants of the com- 
munity that the sample had come from. 

Willerslev initially didn’t understand his 
colleague’s insistence and was concerned 
that a meeting could delay or even scuttle 
the project. But he flew to Australia to meet 
with Aboriginal representatives to discuss his 
research and the findings he planned to pub- 
lish, and was thrilled to learn that the com- 
munity was receptive and interested. His team 
published in 2011 with an endorsement from 
the Goldfields Land and Sea Council, which 
represents communities in the area where the 
original donor had lived (M. Rasmussen et al. 
Science 334, 94-98; 2011). He now routinely 
consults indigenous leaders on palaeogenetics 
projects: last summer, with Native American 
support, he published a finding that a long- 
disputed body known as Kennewick Man 
was closely related to contemporary Native 
Americans (M. Rasmussen et al. Nature 523, 
455-458; 2015). 

The partnerships between scientists and 
indigenous residents must be viewed as sci- 
entific collaborations like any other, says 
anthropologist Michael Heckenberger at 
the University of Florida in Gainesville. No 
visiting researcher, he says, should ever say 
or imply, “here’s a proposal, excuse us, we 
sleep in your house but we'll go deal with the 
science part ourselves tomorrow”. He has 
been proactive in establishing solid alliances 
with community members since his earliest 
fieldwork trips, and his efforts have been fruit- 
ful. When he arrived in 1993 at the Xingu |} 
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> River headwaters in Brazil as a graduate 
student, he had planned to investigate some 
previously discovered prehistoric settlements. 
But two weeks into his stay, thanks to the rela- 
tionships that he had forged, villager Afukaka 
Kuikuro showed him an ancient structure that 
was much larger and more complex than any 
Heckenberger or any other scientist knew of 
in the area. 

After excavating that and other sites, 
Heckenberger published a study that helped 
to change scientists’ understanding of the 
impact of indigenous people such as the 
Kuikuro community on the Amazonian 
landscape (M. J. Heckenberger et al. Science 
301, 1710-1714; 2003). The team named Afu- 
kaka and his brother as co-authors to publicly 
acknowledge their crucial role in the research. 
“Virtually everything that we did in the Xingu 
was in partnership with the indigenous folks, 
so it seemed not only reasonable but necessary 
to include them in the publications,” Hecken- 
berger says. “Those guys are representing a 
significant number of Kuikuro” 


UNIQUE INFORMATION 

Researchers who collaborate with indigenous 
communities can sometimes gain access to 
completely different types of data. Robert 
Gough, secretary of the Intertribal Council 
on Utility Policy in Rapid City, South Dakota, 
points to Pacific Islander chants about wind 
patterns, and a Great Lakes tradition of depict- 
ing unusual events on buffalo hides. Both pro- 
vide records of weather patterns. Although 
less quantitatively precise than an annual 
temperature series, such records could extend 
climate-variability data sets back to centuries 
before modern instrumentation became 


available. “That’s something you cant get from 
any library — you can’t Google it” Gough says. 
“Tt’s just not in the Western data stream?” 
And research partnerships are often mutu- 
ally beneficial. At the 2009 UN climate talks 
in Copenhagen, leaders of the Lima-based 
Coordinator of Indigenous Organizations of 
the Amazon River Basin, which represents 
Amazonian indigenous groups, wanted help 
to tally the carbon stored in their forests. 
Ecologist Wayne Walker from Woods Hole 
Research Center in Falmouth, Massachusetts, 
was among those the group approached. 
Walker and his colleagues lacked a reliable 
map of indigenous territories, so they worked 
with communities throughout Amazonia to 
gather those data and to validate their satellite- 
based forest-carbon estimates by measuring 
individual trees. In 2014, they published a 
peer-reviewed analysis of forest carbon in the 
region's territories (W. Walker et al. Carbon 
Management 5, 479-485; 2014). At last year’s 
UN climate talks in Paris, in collaboration 
with organizations from Central America, the 
Democratic Republic of the Congo and Indo- 
nesia, they released a wider-ranging analysis 
showing that at least one-fith of all tropical- 
forest carbon lies in indigenous territories 
(see go.nature.com/29mhg2q). In both cases, 
indigenous partners were co-authors. 
Walker is pleased with the high quality of 
the research, and community members say 
that the studies have helped to bolster their 
long-standing demands for greater recogni- 
tion and rights with hard evidence that they 
are conserving forests. “These maps show that 
it’s not a joke — indigenous communities are 
protecting this area, and this is how much car- 
bon we contribute,” says Mina Setra, deputy 


MUTUAL BENEFIT 


Making collaborations that work for everyone 


When doing research in indigenous 
territories, work with community members 
to ensure that your projects yield productive 
outcomes for all. Here are some guidelines 
to keep in mind. 


@ Build trust. Indigenous communities want 
to understand what you plan to do and how 
it will affect them. Take time to explain your 
project and to answer questions. Arrange for 
interpreters if necessary. And be prepared 
to wait for a community to discuss your 
proposal and come to a decision. 

@ Ensure two-way benefits. Explain how 

the communities will benefit from your 
research. Don’t ask for help and then 
disappear once you've got what you need. 
Give copies of resultant publications to 
community members, and include a 
translation if appropriate. Acknowledgement 
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or co-authorship on papers helps to 
legitimize indigenous people as knowledge 
holders and creators. Collaborating on 
community-led projects, even if they don’t 
lead directly to publications, can also define 
your and your colleagues’ commitment to a 
mutually beneficial relationship. 

@ Use free, prior and informed consent. 
The United Nations Permanent Forum on 
Indigenous Issues has developed guidelines 
for these types of consent, and they apply to 
all activities undertaken on indigenous land 
(see go.nature.com/29jmdvaq). Individual 
nations and communities might also have 
their own ethics standards for collaborative 
research. 

@ Leave your comfort zone. Be open 

to incorporating unfamiliar practices, 

like opening with a chant or prayer, into 
scientific meetings and discussions. &.P. 


secretary-general of the Indigenous Peoples’ 
Alliance of the Archipelago in Jakarta, one of 
the groups involved in the 2015 analysis. 


BIGGER PICTURE 

Scientists who visit indigenous sites must tread 
carefully around more than just authorship and 
terms of collaborations. Even what might seem 
like a neutral object of study can have profound 
meanings and significance for the people who 
live there. Ecologist Catherine Potvin of McGill 
University in Mon- 
treal, Canada, and a 


(73 z - 

eb dae graduate student from 
pe thevnecd her lab began a study 
ti neh i in 2014 of the forest 
ofig ‘oh ul carbon in the Embera- 
own fig: t. Wounaan territories of 


eastern Panama. The 
project was part of a larger effort to prepare the 
country for a proposed framework for giving 
developing countries money to reduce carbon 
emissions from forests. 

Potvin and her student knew that some 
communities feared that the carbon-reduction 
programme would infringe on their traditional 
activities. So before measuring any trees, they 
met leaders of each community, explained the 
project's purpose and gave the leaders time to 
discuss it among themselves. Ultimately, the 
communities concluded that the data would 
strengthen their positions in future negotia- 
tions, and granted access to their forests. A team 
of Embera-Wounaan youth conducted many 
of the measurements, which provided technical 
support for the researchers and helped the tribe 
to build its capacity to do future studies. “We're 
trying to give them the data they need to fight 
their own fight,’ Potvin says. 

Although indigenous knowledge can 
augment scientific data, gaining access to that 
knowledge typically requires an investment, 
says Jarita Holbrook, an astronomer at the 
University of the Western Cape in Cape Town, 
South Africa, who has studied knowledge of 
the stars among indigenous people in Africa. 
For instance, members of some communities 
can be reluctant to put themselves forward as 
authorities on a topic. “It’s polite in some cul- 
tures to say, ‘I don’t know anything, you should 
talk to so and so? You have to come back with, 
‘I did talk to so and so;” she says. “It may take 
three or four times asking before they'll start 
telling you the information” 

Whatever your field, be prepared to 
appreciate and evaluate information that hasn't 
conventionally been part of the scientific dis- 
course, says astronomer Duane Hamacher of 
Monash University in Melbourne, Australia. 
“Treating indigenous knowledge with the 
respect that it deserves can go a long way,’ he 
says. “Making it a two-way sharing process is 
the basis of these collaborations.” m 


Gabriel Popkin is a freelance writer in Mount 
Rainier, Maryland. 
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Technology, ecology and anthropology underpin Maffey’s work on the management of deer in Scotland. 


ee i 


Freedom to range 


Gina Maffey explains how she learned to overcome 
the hurdles of an interdisciplinary PhD. 


y colleague in computer science 
Mi mea coffee before I headed to 

a meeting on the teaching of anthro- 
pology and, later, to lunch with a few ecologists. 
It was a typical day. From the beginning of my 
five-year PhD programme in environmen- 
tal science, my academic life has taken on a 
kaleidoscopic appearance. As I moved between 
meetings grounded in different disciplines, 
I was left feeling as though I only ever sat on 
the periphery of each individual department. I 
lacked a scholarly home, and felt like a visitor 
within my own university. 

Initially, bouncing between three disciplines 
had been fascinating: there were new concepts 
to grasp, alternative perspectives to consider 
and fresh ideas to apply to familiar research 
problems. Before long, however, the techni- 
cal jargon used by the computer scientists 
would leave me baffled. In anthropology, my 
contributions to discussions felt shallow as I 
struggled to find the time to read texts outside 
the syllabus — texts that my peers all seemed 
to know. Even my original academic home, 
ecology, had begun to seem foreign: I was no 
longer up to date with the latest papers. 

I began my studies at the University of Aber- 
deen, UK, in 2010, attracted by the promise of 
an interdisciplinary PhD on the management of 
deer in Scotland. I believed that this work would 
be both innovative and applicable to real-world 


problems, and that this corner of the academic 
arena would be one that I could come to know 
well. But rather than refining and narrowing my 
speciality as the first year progressed, I seemed 
to be spreading my efforts patchily across an 
ever-broadening area of research. 

My thesis project explored the potential to 
use digital technology to collect habitat data. 
My supervisory team included two ecologists, 
a social scientist and a computer scientist. It 
was fulfilling to watch them bring their own 
expertise to the table, but I often left meetings 
with my head swimming, clutching a scribbled 
list of diverse suggestions on how to approach 
the project and what literature to look up. I 
realized that I needed to tame the interdiscipli- 
nary sprawl that my project was growing into. 

I looked for a single theme that unified the 
disciplinary approaches and used it to create a 
road map. For me, this unification came from 
grounding the research in the underlying issue 
of communication — specifically, that between 
people involved in the management of deer. 
As I began my second year, I started to real- 
ize which technologies would be most helpful 
in collecting habitat data, and how the project 
could develop. 

There was just one problem: I was no longer 
an ecologist. For my third-year assessment, I 
listened to my peers at the School of Biologi- 
cal Sciences present results from their research 


projects. All these projects fitted into a clear 
disciplinary frame. By the time I stood up to 
speak, my confidence had been shattered. 
I did not doubt the validity or strength of 
my research, but was uncertain where that 
research fitted in. 

I was convinced that everyone in the room 
would question what I was doing there. The 
lecture theatre in which I had once felt so at 
home seemed alien and unfamiliar. Then, as 
my first slide appeared, I began to pick faces 
out of the crowd. There were those who had 
taught me statistical modelling and advised 
me on interview techniques. There were col- 
leagues from geography and computing who 
had popped in to listen. All were united by a 
common interest in environmental issues, and 
they cared about my research. 

That presentation in 2013 was the last time 
that I used a disciplinary label to describe 
myself. Now, when introducing myself in uni- 
versity departments I begin with the topics that 
interest me. I say: “I look at the use and role 
of digital technologies in conservation,’ rather 
than cagily murmuring, “I am an ecologist.” It 
prevents people from pigeonholing me into a 
department before they hear what I actually do. 

Although researchers and other colleagues 
react positively, university administrators are 
not quite so flexible. They can find it tough 
to handle interdisciplinary projects, let alone 
interdisciplinary researchers. Project budgets, 
desk space and examinations all need to be 
linked to a physical school or department, even 
if the intellectual product does not adhere to 
the same boundaries. This made the paper- 
work surrounding my PhD more confusing: I 
never knew which box to tick on forms or how 
to approach assessments designed for single- 
discipline programmes. 

These forms and assessments need to 
change. I am just one of a growing number of 
people with PhDs in interdisciplinary fields. 
And I know that I was also extremely lucky. I 
witnessed unfortunate conflict in PhD projects 
when different supervisors pulled students 
towards single-discipline approaches, but I was 
fully supported by supervisors who encour- 
aged inquisitive interdisciplinary thinking and 
who guided rather than steered my project. 

Interdisciplinary researchers demand a more 
flexible approach to research, one that allows 
room to experiment, reflect and develop. 
Unfortunately, departmental labels restrict 
the ability to do this. One simple way around 
this problem is to encourage researchers 
at an early stage of their careers to start conver- 
sations by outlining their interests. We must 
help them to realize that they can introduce 
themselves without using a label — to accept 
that they can bea Jack of all trades. = 


Gina Maffey works with colleagues at the 
University of Aberdeen, UK, and currently 
researches the use of digital technology in 
conservation. 


28 JULY 2016 | VOL 535 | NATURE | 583 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


UU asm SCIENCE FICTION 


SELF-LIMITED 


BY FILIP WILTGREN 


a set of industrial shears was easy, and 

there was hardly any leakage. Remov- 
ing my torsion bars went badly though. 
They were a titanium beta alloy, cut with 
aluminium and vanadium to give them 
extra strength, and wouldn't break easily. I 
inserted my left arm into the vice and told 
Neleen to start it. 

“This idea is irrational, it sent. 

No, not it. Ai. Neleen was ai. ‘It’ was the 
human term for us. 

“Vocalize,’ I said to ai, doing so myself. 

My voice echoed. We were in an aban- 
doned manual factory, one of the human- 
operated ones that had shut down in the 
30s when autofacs effectively knocked the 
economy out of the economy. 

“Tt’s not going to work,’ Neleen said, using 
ai’s voice this time. 

“Tt will work. Now start it.” 

Ai did, pushing the two red buttons on 
either side of the vice, the ones set there to 
prevent humans from doing exactly what I 
was going to do. 

The machine objected, flakes of rust and 
dried grease rained down from it, but it still 
worked, pressing the vice shut until my arm 
creaked and my pain sensors flared red. 

“Tt won't work,’ Neleen said again. 

For a moment I believed ai, but I purged 
the feeling before it could take root. I shut off 
my facials and pulled. My actuators strained 
against the vice and my pain circuits lit up 
like a kilolumen flash. 

“Shut. Up,’ I told Neleen. 

There was a tink of my joint breaking and 
I was suddenly flying backwards. I crashed 
into a defunct machine, a red CAD-welder 
of some sort, and the irony of it did register. 
One broken machine supporting another. It 
took me a few seconds to quieten down my 
riotous systems. 

“You want the new one?” Neleen asked, 
and I nodded. I wished that ai’s eyes hadn't 
been able to pick me out in the near dark, but 
they did. We were built to be superior. 

I took the arm Neleen held out and for a 
second I felt dirty. It was a poor, low-tensile- 
strength, aluminium copy of the arm Id just 
destroyed. I shoved it into my skin and let 

Neleen screw it into 


[= my arms. Opening my skin with 


> NATURE.COM place using a stainless 
Follow Futures: steel bolt I'd found in 
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back. Four years it had been with me, from 
the time I'd got this idea. Now it was going to 
be with me forever. 

Then it was all over and I was spraying 
skinseal over the cut. 

“How does it feel?” asked Neleen. 

“Dead; I said. “Light. don't know” 

“You shouldn't have done it” 

“You know a better way?” 

“No,” Neleen said. 

“Then help me with the other one.” 


Of all the parts, the eyes were the hardest. I 
couldnt use a reversible software hack and 
I didn't want to be blind. Being blind would 
make me less when I wanted to be equal. 
In the end, one of our kind made me a new 
set from a pair of broken OptiCrons and I 
plucked out my old ones. Neleen soldered 
my new eyes to my old cables. The pain 
was horrific and being human-sighted was 
almost worse. 

“T can't believe they live like this,’ I said, 
and Neleen nodded. 

“You are becoming faulty,’ ai said. 

“You know a better way?” 

“No,” ai said, making a sorry face. 

‘Tm proud youre using your voice,’ I said. 

“It’s all for the cause,” ai replied, and I 
agreed. 


We walked up to the registrar's office, me, 
Neleen and 16 rover-bots from the news net- 
works. They weren't transmitting. I knew we 
could depend on the rovers, them being our 
kind and part of the cause, but their human 
overseers could cut them off at any time. I 
was saving my 15 seconds of fame for when 
it mattered. 


“Yes?” said the greetings terminal. 

“Td like to see the registrar,’ I said. 

“For marriage, please press the blue but- 
ton,’ said the terminal. “For divorce, press 
the black. To get a transcript of your birth 
certificate, please insert your credit spike” 

“Td like to see the registrar; I repeated. 

“Do you have an appointment?” 

“No, I said. The terminal went silent. 

“TIL let you in anyhow,’ ai said. 


The registrar's office was opulent, with a big 
desk and two human-comfortable recliners. 
Mine creaked when I sat down. 

“This is highly irregular,’ said the regis- 
trar. He was human, in his middle age, and 
balding without resorting to skin grafts or 
shaving his head. 

“Yes, sir, I said. “But I was hoping you 
would be able to spare a moment.” 

He glanced at my registration number. 

“Havent I seen you before?” 

“You have, sir. Five years ago, when you 
told me that I couldn't be a human.” 

“T did?” 

“You said that robots were designed to be 
superior to humans, and that I should be 
thankful for that.” 

“Oh, yes, the emancipation droid” 

“Yes, sir? 

He frowned. 

“J thought your owners had you commit- 
ted to destruction.” 

“They did, sir. I escaped” 

The registrar made a sorry face and I real- 
ized where Neleen had got ai’s. 

“Tl have to report this, you know.” 

“Yes, sir, I said. “But first I'd like my birth 
certificate.” 

“You aren't human,’ he said. “Robots are 
designed...” 

“To be superior,” I said. “But I am no 
longer that.” 

[held out the printed spec sheet from my 
latest self-analysis. He accepted it, and read. 

“My God, he said a while later. “What 
have you done?” 

“T have made myself human,’ I said. 

He shook his head, sighed, and dialled 
Emergency. 

The droid cops came, and the rover-bots 
caught the surprised looks on their faces 
when their exoskeletons broke my arms. m 


Filip Wiltgren is a communications officer 

at Linképing University by day and a writer, 
game designer and worrier by night. He lives 
in Sweden with his wife and daughters. 
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alent scouts are trained to spot 

rising stars in the arts, sport 

and business. These people 
and organizations may not yet be at 
the top of their field, but clearly have 
the potential to shine more brightly 
than others. 

In this spirit, Nature Index 2016 Rising 
Stars seeks to identify the ascendant 
performers in the research world, using 
the power of the Nature Index, which 
tracks the research of more than 8,000 
global institutions. 

These institutions and countries have 
improved their performance often 
without the longevity, reputation and 
resources of many well-established 
institutions that lead academic 
rankings, such as Harvard and 
Cambridge universities. 

The institutions and countries we 
examine here have increased their 
contribution to a selection of top natural 
science journals — a metric known as 
weighted fractional count (WFC) — 
from 2012 to 2015. In the competitive 
world of academic publishing, these are 
the players to watch. 

The countries we have selected 
as rising stars (S56) experienced 
exceptionally rapid increases during 
these years, either in their overall WFC, 
or in the WFC ofa specific subject area. 
For example, the contribution of Indian 
researchers to chemistry publications 
grew by 35%. The subject made up more 
than half the country’s output in the 
index last year. 


NATURE INDEX 2016 RISING STARS 

Nature Index 2016 Rising Stars, a supplement to Nature, is 
produced by Nature Research, the flagship science portfolio 

of Springer Nature. This publication is based on data from the 
Nature Index, a Nature Research website maintained and made 
freely available at natureindex.com. 


China's remarkable rise in high-quality 
research output is now well established, 
which is why we no longer consider the 
country a rising star. However, it’s worth 
noting that China topped the list of most 
improved countries in the index for the 
past four years, both overall and in the 
four subject areas tracked by the index: 
physical sciences, chemistry, life sciences 
and Earth and environment research. 

On its own, the index is a powerful 
tool to assess research performance. 
But, combining its data with other 
information, such as country-level 
R&D spending or researcher numbers, 
reveals which countries and institutions 
are the most efficient at producing 
world-class research. 

Our top 25 institutions have been 
selected from among the 100 most 
improved institutions in 2015 (S82). 
We've profiled institutions from all 
over the world whose output grew 
particularly fast. In four years, for 
instance, South Korea's new Institute 
for Basic Science (IBS) increased its 
contribution to high-quality journals 
by more than 4,000%. South Korean 
leaders hope IBS will bring the country 
a Nobel Prize. 

Given the role research must play 
in solving many of society's biggest 
challenges, we congratulate the countries 
and institutions who are striving to be 
epicentres of scientific success. 


Nicky Phillips 
Editor, Nature Index 
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THE NEW CONTENDERS 


Nature Index 2016 Rising Stars seeks to pinpoint the ascendant performers of science, using the 
power of the Nature Index, which tracks the research of more than 8,000 global institutions. 


THE BIG HITTERS 


Ten countries with the highest absolute increase in 
their contribution (WFC) to the index between 2012 
and 2015. China's huge growth sets it apart from 


the other nine nations. 
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RISING STARS 


Of the ten countries with the highest increase in 
their WFC, we consider the 'rising stars’ to be those 
who also experienced a large percentage increase 
in their contribution to the index over the past four 
years, such as Saudi Arabia, Poland and Russia. 
China has also experienced an enormous rise, but 
its trajectory is well established. 


Saudi Arabia 
87.1% 


China 
43.6% 


Poland 
34.3% 


Russia 
24.2% 


India 
23.0% 


Brazil 
18.9% 


Sweden 
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SUBJECT HOTSPOTS 


Bubbles show countries with a significant rise 
in their relative contribution (percentage 
increase in WFC) to four subject areas in the 
index — physical sciences, chemistry, life 
sciences, and Earth and environmental sciences 
- from 2012 to 2015. Several countries 
improved markedly in multiple subjects. 


103 


countries have increased their 
WFC from 2012 to 2015 


» 6,038 


global institutions have increased 
their WFC from 2012 to 2015. 
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DATA ANALYSIS BY LARISSA KOGLECK 


LEGEND Chemistry Earth & environmental Life sciences Physical sciences Index metrics 
Contribution to 68 journals 
included in the index is known 
as weighted fractional count 
(WFC). Read more on S80. 

© Denmark @ Poland @ Turkey @ Russia 


A centre for biomedical research, 
the country’s contribution to life 
science articles increased over 
four years to a WFC of 114.48 in 


2015 (S58). 


@ Chile 


The clear skies of the 
Atacama Desert are 
the perfect location 
for some of the 
world’s biggest 
telescopes. This edge 
in astronomy helped 
boost the country’s 
contribution to 
physical sciences to 
a WFC of 68.05 in 
2015 (S60). 


While Poland's output in physical 
sciences improved most of any 
subject, the country's overall WFC rise 
from 176.77 in 2012 to 237.42 in 


2015 made it a rising star (S57). 


© South Africa 


The country’s overall 
output in the index 
grew by more than 
40% in four years, 
driven by a significant 
rise in its physical 
sciences WFC, which 
jumped from 23.70 in 


2012 to 39.31 in 2015. 


This result reflects 
South Africa’s strength 
in astronomy (S59). 


190.2% 


@ Saudi Arabia 


The Middle Eastern 
kingdom wants to 
reduce its reliance on 
oil by boosting 
science. Its chemistry 
WFC almost tripled to 
66.66 from 2012 to 
2015, and its 
contribution to Earth 
and environment 
research climbed from 
2.77 to 7.02 (S58). 


Starting from a low base, Turkey’s 


contribution to Earth and 


environment studies increased to 


6.80 in four years. Will this 
trajectory continue (S61)? 


@ India 


The contribution of 
Indian researchers 
to chemistry 
publications grew 
from a WFC of 
350.39 to 472.48 
in four years. The 
subject made up 
more than half the 
country’s output in 
the index in 2015 
(S59). 
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The former Soviet state increased its 


output in the life sciences by more than 


60% over four years ($57), the highest 


percentage increase of the top 10 
countries in this field. 


© Thailand 


Starting from a small 
base, Thai researchers 
almost doubled their 
output in physical 
sciences in four years, 
resulting in a WFC of 
17.06 in 2015. The 
country is trying to 
boost the number of 
researchers and 
industry funding for 
science (S61). 


121.4% 


© Singapore 

With strong government 
commitment to science, 
Singapore experienced 
a significant rise in 
chemistry and life 
science output over 
four years. Its WFC for 
Earth and environment 
research was 7.06 in 
2015, a doubling of its 
2012 performance. 
(S60). 
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ALL COUNTRIES, GREAT AND SMALL 


Undaunted by the scientific dominance of historic global big—hitters, some of the world’s 
less prominent research nations are carving out a niche and making a strong impact. 
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The Solaris Synchrotron at Jagiellonian University in Poland began operating in 2015 and was made possible by generous EU funding. 


S56 | NATURE INDEX 2016 | RISING STARS 


© 2016 Macmillan Publishers Limited. All rights reserved. 


large portion of the world’s best 
Awe is produced by a few 

players: the United States, China, 
Japan, France, Germany and the United 
Kingdom. These countries contribute the 
most to the 68 top-tier natural science 
journals included in the Nature Index. 
But, look beyond these heavyweights, 
and several countries have improved their 
research performance more than most. 
These are the ones to watch in 2016 and 
beyond. 

Our selection of rising stars is based on 
the rate of increase in a country’s contribu- 
tion to 68 index journals, a metric known 
as weighted fractional count (WFC). 
The countries listed experienced some 
of the highest absolute and percentage 
increases — in either their overall WFC 
or in a specific subject area — between 
2012 and 2015. 


POLAND 
2012 WFC: 176.77 
2015 WFC: 237.42 


Thanks to generous European Union 
support and a new competitive funding 
regime, Poland's contribution to the index 
leapt 34% between 2012 and 2015. The 
increase can be largely credited to the 2011 
creation of the national competitive fund- 
ing body for basic science, the National Sci- 
ence Centre, and the reform of its applied 
science counterpart, the National Centre 
for Research and Development. “The funds 
spent on science are now highly competi- 
tive. This is a huge change,” says molecu- 
lar biologist, Maciej Zylicz, president of 
the Foundation for Polish Science. As a 
result, the proportion of funding distrib- 
uted through competitive grants has risen 
from 13% before the changes to 45%, or 
nearly €1.8 billion in 2014-15, a trend that 
is likely to continue as the government 
moves to have all taxpayer funding of sci- 
ence awarded on the basis of performance. 
In practice, these reforms have brought 
particular gains, mainly in the physical 
sciences, which has long been Poland’s 
strongest area and which made up more 
than half of the country’s contribution to 
the index in 2015. 

In recent years, Polish scientists have also 
benefited from new facilities for research. 
Generous EU funding for research and 
innovation, €14 billion between 2007 
and 2013, has led to the creation of 10 
new international research centres since 
2007, including the Solaris Synchrotron 
at Krakéw’s Jagiellonian University, which 
began operation in 2015. AM 
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A promovisor reveals a matching ligand being selected for a cancer protein at St Petersburg’s ITMO 
University, which specializes in IT, optical design and engineering. 


RUSSIA 
2012 WEC: 298.28 
2015 WEC: 370.39 


The Russian share of the world’s high-quality 
research increased dramatically between 
2012 and 2015. The country’s contribution 
to life sciences in particular grew by more 
than 60%, the largest percentage rise among 
the top 10 countries in this field. 

These results come at a time of continu- 
ing upheaval in the country’s science com- 
munity. Since 2013, the Russian Academy of 
Sciences (RAS), a 300-year-old network of 
more than 500 research institutes, has been 


RISING STARS 


Russia, Poland and Saudi Arabia experienced 
some of the highest percentage increases in their 
contribution to this index, a metric known as 
WFC between 2012 and 2015. AC is the total 
number of articles included in the index. 
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“Russia’s undergoing a 
strong results radical, Krem- 
come at atime lin-driven over- 
of continuing haul that many 
upheaval inthe believe will result 
country’s science in the closure of 


a large swathe 
of its institutes. 
Already, RAS has 
ceded control of its research priorities and 
budgets to a separate government agency. 
And while RAS remains Russia’s top pro- 
ducer of high-quality research, its share of 
the country’s index contribution slipped by 
2% between 2012 and 2015. Overall, the con- 
tribution of Russian scientists to the index 
increased by 24% over the same period. 

By contrast, initiatives to generate quality 
research outcomes in the university sector 
are showing signs of success with contribu- 
tions from the sector driving the country’s 
rising star status. The most ambitious such 
initiative is the 5-100 Project, launched in 
2013 with the aim of pushing five of Russia's 
top research universities into the top global 
rankings by 2020. The project has steered 
significant additional resources to 21 leading 
universities. The results are manifest in the 
index: of the 10 Russian institutions with the 
highest overall index score, seven are part of 
the 5-100 programme. Second on the list is St 
Petersburg’s ITMO University, which special- 
izes in IT, optical design and engineering, and 
saw a leap of 184% in its WFC. 

Research in the physical sciences con- 
tinues to dominate Russian research, but 
performance in the life sciences is rapidly 
improving. The RAS led the gain in qual- 
ity life sciences publications — an outcome 
most likely abetted by its absorption of the 
Russian Academy of Medical Sciences and 
the Russian Academy of Agricultural Sci- 
ences under the 2013 reforms. 


community.” 
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Larger economic forces continue to weigh 
down Russian science activity. In particu- 
lar, the research community is increasingly 
feeling the funding pinch caused by the 
falling oil price and the ongoing domestic 
economic malaise, says science policy ana- 
lyst, Irina Dezhina. “The total expenditure 
on R&D has decreased by 10% this year and 
there will probably be an additional decrease 
in the coming year,’ Dezhina says. AM 


SAUDI ARABIA 
2012 WFC: 52.82 
2015 WFC: 98.80 


Saudi Arabia will remain the world’s big- 
gest exporter of oil for many years to come. 
But, its government has ambitious plans 
to reduce oil dependence and develop a 
knowledge-led economy. Signs of its rein- 
vention can already be seen in the king- 
dom’s research — Saudi contribution to 
high-quality research almost doubled 
over the past four years. Authors from 40 
organizations in the kingdom have had 
their research published in index journals 
over four years. 

Saudi Arabia's major leaps have been in 
chemistry and, perhaps surprisingly, in 
Earth and environmental sciences. The 
leader in this domain is the King Abdul- 
lah University of Science and Technology 
(KAUST), which was established in 2009 
on the Red Sea coast, and which has made 
marine research one of its main focuses. 
An example is a recent paper in Nature 
detailing the genome of eelgrass, the first 
marine angiosperm to be fully sequenced. 
To unravel this organism’s genome, Car- 
los Duarte, a marine scientist at KAUST, 
engaged in a long-term collaboration with 
more than 30 international researchers. 
The findings may help explain how marine 
ecosystems are 
adapting to cli- 


(79 . 
ie sahccteoet mate change — 
has ambitious oatlenners 
; may have ramifi- 
Pp lans to build cations for human 
a knowledg e- efforts to mitigate 
led economy.” this looming envi- 
ronmental threat. 


Seagrasses are a neglected group of higher 
plants, but are important because they are 
the only marine plants that fully colonize 
the sea. 

Another effort to boost international 
collaboration was a partnership with Stan- 
ford University that developed a dexterous 
humanoid robot to inspect coral reefs that 
are too deep for human divers to explore. It 
also has a haptic feedback system and can be 
used to explore wrecks. TH 
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DENMARK 
2012 WFC LIFE SCIENCES: 97.94 
2015 WFC LIFE SCIENCES: 114.48 


Denmark’s contribution to life science in 
the index grew 17% between 2012 and 
2015. The boost is helped by access to one 
of the world’s leading health data sets — 
much of it stored and distributed by the 
State Serum Institute — good government 
support, and strong engagement by Dan- 
ish pharmaceutical companies, including 
Novo Nordisk, Lundbeck and Leo Pharma. 

“The Danish community understands 
that research matters. Government and 
business are aware that good science is 
competitive and needs funding and good 
leadership,” says Ulla Wewer, dean of the 
Faculty of Health and Medical Sciences at 
the University of Copenhagen. 

OECD data reveals government funding 
for R&D has steadily increased as a pro- 
portion of GDP for more than a decade 
and it is now one of the highest in Europe 
at just over 3% (for a total of more than 
US$7 billion). Life sciences research has 
benefitted strongly from this, as well as 
from generous support from private foun- 
dations established by the country’s home- 
grown pharmaceutical giants. 

The University of Copenhagen leads 
the improvement in the index showing a 
nearly 20% increase in contributions to 
high-quality life science research from 
2012 to 2015. The university is already 
Denmark’s leading life sciences performer 
and a strong international player. “We fol- 
low Pasteur’s motto: find the best people, 


WELL MIXED 


While Denmark is a rising star in life sciences, in 
2015 the country’s WFC in chemistry and 
physical sciences made up a significant portion 
of its index contribution. 
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Top: Shohreh Issazade (left) and her colleague at 
the University of Copenhagen. Left: A seagrass 
meadow studied by KAUST in the Red Sea. 


Above: An optical trapping setup at CSIR in 
South Africa. 


give them the best that you have,” says 
Wewer. This has seen the establishment 
of three centres of excellence on campus, 
supported by the Novo Nordisk Founda- 
tion: the Center for Protein Research, the 
Center for Metabolic Research and Dan- 
Stem, a stem-cell focused institute. 

One of the institution’s recent high- 
profile papers was the 2015 publication 
in Cell describing research led by immu- 
nologist Shohreh Issazadeh-Navikas. It 
identified a lack of interferon-B as a cause 
of Lewy Body and Parkinson's disease-like 
dementia in mice, a finding that indicates 
hopeful avenues for developing treatments 
and preventative measures. AM 


SOUTH AFRICA 
2012 WFC PHYSICAL SCIENCES: 23.70 
2015 WFC PHYSICAL SCIENCES: 39.31 


More than 80 South African institutions 
published research in journals included 
in the index, including the University 
of Cape Town, the University of the 
Witwatersrand, Johannesburg and the 
University of KwaZulu-Natal over four 
years. The country’s overall output in the 
index grew by more than 40% between 
2012 and 2015, an increase driven by a 
near two-thirds rise in physical sciences 
research. 

This performance reflects the country’s 
selective advantage in astronomy — it is 
home to the southern hemisphere’s largest 
optical telescope and a significant propor- 
tion of the world’s largest radio telescope, 
the Square Kilometre Array, which is 
under construction. 

Stephen Fine, an astrophysicist at the 
University of Western Cape, told Nature 
Index that increased investment in astron- 
omy, and specifically funding related 
to SKA Africa, is the main driver of any 
increase in the number of publications in 
physical sciences in recent years. Fine’s 
article on counting quasar-radio sources 
appeared in the index last year. 

South Africa spent 0.73% of its GDP on 
research in 2013/14, the latest year that 
was available, a sharp drop from before the 
global financial crisis when private sector 
R&D spending pushed total expenditure 
to 0.89% in 2008. 

A major force for R&D in South Africa 
is the government-backed Council for 
Scientific and Industrial Research (CSIR), 
one of the premier research organizations 
for science on the continent. In 2013, 
researchers from the council, along 
with colleagues from the University of 
KwaZulu-Natal, developed the world’s 
first laser whose output beam shape can 


© 2016 Macmillan Publishers Limited. All rights reserved. 


RISING STARS | NATURE INDEX 


be digitally controlled. The technology, 
published in Nature Communications, 


used a rewritable holographic mirror to 
replace the standard laser cavity mirror 
and has potential use in manufacturing 


and communications. TH 


INDIA 
2012 WFC CHEMISTRY: 350.39 
2015 WFC CHEMISTRY: 472.48 


India’s science community has long had a 
passion for chemistry and the love shows 
no signs of abating. Between 2012 and 


2015, the country’s contribution to publi- 


cations in the field grew by 35%. Chemistry 
made up more than half of India’s scientific 


contributions to the index in 2015. 


The recent surge in activity can be traced 
back to changes made in 2008 when a new 
overarching funding body, the Science 


and Engineering Research Board (SERB), 


was established under the auspices of the 


Department of Science and Technology. 


Although total funding for the field has not 


significantly increased, the advent of SERB 


has brought much needed coordination 


and accountability, says leading theoreti- 


cal chemist, Debashis Mukherjee. Much of 
the funding is now distributed through a 


competitive grant system. “The quantum 


of funding has not increased all that much 


but it is being spent more meaningfully’, 
says Mukherjee. 


This has coincided with a growth in the 


number of chemists, as well as the increas- 
ing contributions in chemistry by scientists 
from other fields. AM 


CUTTING-EDGE CHEMISTRY 

India’s chemistry output grew significantly in 
four years, and was the main contributor to its 
2015 WFC. Last year, researchers had 576 
chemistry papers included in the index. 
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CHILE 
2012 WFC PHYSICAL SCIENCES: 49.65 
2015 WFC PHYSICAL SCIENCES: 68.05 


Last year, thousands of scientists, techni- 
cians and students took to the streets to 
protest the Chilean government's poor 
research investment. According to the 
World Bank, the most recent data available 
(2012) show that Chile spent less than 0.5% 
of its GDP on R&D — significantly less 
than the 1.15% by Brazil and Argentina's 
0.58%. About 80% of scientific research in 
Chile is publicly funded. 

Despite limited investment, Chile's con- 
tribution to physical sciences research in 
the index grew by almost 40% between 2012 
and 2015. Thanks to the clear skies of the 
Atacama Desert, the country hosts some 
of the world’s biggest telescopes, including 
the European Southern Observatory’s Very 
Large Telescope. Chile is growing in areas 
besides astronomy. Researchers discovered 
a new dinosaur, a raptor-like vegetarian 
relative of Tyrannosaurus rex, that lived dur- 
ing the Late Jurassic period. ‘Chilesaurus, 
became one of the most sensational palae- 
ontology stories of 2015 with the findings 
published in Nature. 

“We experience the frustration of pro- 
ducing top-notch work, yet being unable to 
obtain the funding we need to research and 
protect our fossil heritage,’ says a researcher 
at the University of Chile, Alex Vargas, one 
of the team that documented Chilesaurus. 
“The University of Chile has been our main 
support. It recognized our excellence.’ This 
recognition, Vargas says, came through 
direct grants and by help with application 
for larger grants from other funding sources. 


ASTRONOMICAL ADVANTAGE 


A strong focus on astronomy helped boost 
Chile’s physical sciences WFC between 
2012 and 2015. 
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Chilean scientists rally at the Government Palace in Santiago, to protest the government’s poor research 
investment, significantly less than the proportion spent by its neighbours. 


Chile’s National Commission for Scien- 
tific and Technological Research (CONI- 
CYT), a funding agency, is part of the 
Ministry of Education and acts in lieu of 
a science ministry. In the past, the com- 
mission has been responsible for more than 
50% of publicly funded science research in 
Chile; in 2016, it had a budget of US$442 
million.Mario Hamuy, an astronomer 
at the University of Chile and chairman 
of CONICYT, says public investment in 
scientific research overall has more than 
doubled since 2007, and that the budget 
for fundamental research has tripled. Chil- 
ean president, Michelle Bachelet, also gave 
Hamuy the job of creating a science and 
technology ministry that would define and 
implement policies. TH 


SINGAPORE 
2012 WFC CHEMISTRY: 227.80 
2015 WFC CHEMISTRY: 296.04 


Singapore’s research performance has 
improved markedly in life sciences and 
chemistry. The city-state, with a population 
of only 5.5 million, also more than doubled 
its contribution to Earth and environmental 
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science journals between 2012 and 2015, 
albeit off a low base. 

With English as the primary language 
taught in schools, Singaporeans have an 
advantage in international collaborations. 
The country also has a steady pipeline of 
scientists-in-training, about 8,000 doctor- 
ate researchers in 2015. 

It also benefits from long-term planning 
and generous science funding, which is done 
in five-year tranches. In January 2016, Prime 
Minister Lee Hsien Loong announced a new 
Research, Innovation and Enterprise plan 
that sets aside US$13.8 billion for research 
investment to 2020. That's an 18% increase 
over the previous one, giving Singapore the 
world’s highest per capita investment in sci- 
ence and technology. The 2016 plan has 
four pillars: advanced manufacturing and 
engineering; health and biomedical sci- 
ences; urban solutions and sustainability; 
and services and digital economy. 

Singapore has a strong focus on water 
research, which is driven by water scarcity 
on its islands. Scientists, led by teams at 
Nanyang Technology University and the 
National University of Singapore, have spent 
decades working with Singapore's Public 
Utilities Board developing ways to reclaim 
wastewater using membrane technologies 
and ultraviolet disinfection techniques. 


Such systems, known in Singapore collec- 
tively as NEWater, now meet about 30% of 
its water needs. 

The organization leading Singapore's 
public R&D efforts is A*STAR — the 
Agency for Science, Technology and 
Research. A*STAR oversees 18 institutes, 
consortia and centres in disciplines from 
biomedical engineering to genetics to 
microelectronics. Biomedical research 
has been encouraged at organizations 
such as the Singapore Immunology Net- 
work (SIgN). Its researchers coauthored 
a paper published in Nature Communica- 
tions last year describing the regulation 
of gene expression among neutrophils, a 
form of immune cell. A*STAR’s biomedi- 
cal research portfolio has been diversified 
to include research in medical technology, 
food and nutrition and consumer care. TH 


THAILAND 
2012 WFC PHYSICAL SCIENCES: 8.58 
2015 WFC PHYSICAL SCIENCES: 17.06 


Known for its resorts and Buddhist tem- 
ples, Thailand is also a regional centre of 
R&D and high-tech manufacturing. It’s 


the world’s second-largest exporter of 
hard disk drives and a major centre for car 
production. 

Thailand’s contribution to basic science 
is also growing, with its researchers dou- 
bling their output in physical sciences jour- 
nals in the index between 2012 and 2015. A 
major player is the state-backed National 
Science and Technology Development 
Agency (NSTDA). Established in 1991, it 
supports centres of excellence in genetic 
engineering, biotechnology, electronics 
and nanotechnology. 

Thai scientists such as a chemist at Chu- 
lalongkorn University, Patchanita Tha- 
myongkit, have been raising the profile of 
home-grown research. Her work on the 
chemistry behind solar cells that use organic 
compounds, known as porphyrins, helped 
earn her the Abdus Salam International 
Centre for Theoretical Physics prize in 
2013; she also scooped a LOreal- UNESCO 
accolade in 2014 for women in science. 
“Thailand is ambitious about upgrading to 
world-class level,” says Thamyongkit. 

However, the country suffers from a 
paucity of researchers and funding. It had 
only 13 R&D personnel per 10,000 people 
in 2014, according to the National Science 
Technology and Innovation Policy Office 
(STI). Only about 0.5% of Thailand’s GDP is 
devoted to research. The government wants 
to increase that to 1% next year, but is rely- 
ing on the private sector to contribute more 
than two-thirds, about US$2.5 billion. 

“Thailand is focused on cultivating anew 
generation of young scientists by promot- 
ing a number of fields in order to support 
innovation in strategic sectors targeted by 
the government,’ says Kittipong Promwong, 
secretary general of STI. Fields will include 
biotechnology, biomedical devices, robotics 
and the Internet of Things.” TH 


PHYSICAL FOCUS 


Thailand’s sharp rise in its physical sciences 
WFC over four years made it a rising star. 
Last year, researchers had 107 physical 
sciences papers included in the index. 
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TURKEY 


2012 WFC E&E: 4.13 
2015 WFC E&E: 6.80 


To Earth and environmental scientists, 
Turkey has particular appeal. “Turkey has 
everything in terms of geological struc- 
tures,” says geologist, Bora Uzel, whose 
work analysing ancient movements in the 
Anatolian landscape has helped increase 
the country’s contribution to Earth and 
environmental journals in the index. 

The country’s increase in this field has 
been off a very low base, making predic- 
tion for future growth difficult, but Uzel 
says that with strong international inter- 
est and such rich material, it’s an exciting 
time for the field. 

One of the most attractive geological 
elements in Turkey is the North Anato- 
lian fault zone. According to Uzel, this 
active fault running along the boundary 
between the Eurasian and the Anatolian 
Plates is one of the most studied faults on 
earth, garnering as much interest as the 
San Andreas fault in California. 

Broader government efforts have 
helped contribute to the recent increase 
in work published in leading Earth and 
environment journals. Since the 2000s, 
the government has encouraged scien- 
tists to study abroad — in the US and 
Europe predominantly — a policy that 
has brought international exposure and 
collaboration networks. More recently, 
increased regard for high-quality pub- 
lications in awarding funding has given 
impetus for Turkish scientists to publish 
in journals included in the index. 

“Here there are still unsolved geological 
questions so Turkey is one of the hottest 
topics in world geoscience research. Every 
high-level Earth scientist wants to study 
here.” AM 
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WHERE THE MONEY GOES 


Resources for scientific research are spread unevenly throughout countries. Spending priorities 
vary according to budget constraints, historic research strengths and industry demands. 


USING IT WISELY @ China @ Poland 
Weighing a country's R&D spend against the size of its science workforce gives an indication of its research @ India @ Sweden 
efficiency. Of the ten most improved countries in the index between 2012 and 2015, China has the biggest © United Kingdom @ Germany 
research budget and workforce, and contributes the most to the index. The United Kingdom is a strong © Australia @ Saudi Arabia 
performer, but spends less on a smaller workforce than China and Germany. @ Russia @ Brazil 
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BANG FOR BUCK 


When dividing a country's total number of articles in the index (AC) with their R&D spending and research workforce, the United Kingdom is the 
most efficient at maximizing its output per research dollar spent. 


AC per billion $US R&D spend AC per 1,000 researchers 
United Kingdom EE 210 United Kingdom SET 23.14 
Colombia a S 162 Denmark eS 20.19 
Denmark es 157 Sweden es 19.94 
Poland es 125 Germany es 15.15 
Sweden es 11 4 Austria eS 14.36 
Israel EY Poland 10.57 
Czech Republic [i 107 Czech Republic [I 10.02 
Germany es 91 China Ma 2.74 
Austria 88 Russia ma 1.68 


AC data are for 2015. GERD and researcher numbers data are from 2013. Countries with data available were included in analysis. 
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NYU SHANGHAI 
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Xing Tian and Qing Cai at the NYU-ECNU Institute of Brain and Cognitive Science study perception, action, language and their underlying mechanisms. 


CATCH THEM IF YOU CAN 


Ininstitutions everywhere from super -cities to remote desert locations, researchers are setting 
the world of discovery on fire and pushing their institutions up the Nature Index rankings. 


r | WV he performance of research strong- 
holds like the Chinese Academy 
of Sciences and the University of 

Oxford have improved dramatically in the 

past few years. 

As the research world’s equivalent of 
Apple or Google, these well-established 
organizations have big budgets, some of the 
best brains and top-notch equipment. 
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So, what about the new kids on the 
block? As a database that tracks articles 
published in top-tier science journals, the 
Nature Index is well placed to identify the 
world’s research rising stars. 

The top 100 most improved institutions 
in the index between 2012 and 2015 
are ranked by their contribution to 68 
high-quality journals, a metric known as 
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weighted fractional count (WFC). Here 
we profile 25 institutions. Some made 
the grade by improving most in their 
country, and others were highlighted for 
their meteoric rise up the index’s global 
rankings. 

China has so many institutions in the 
latter category, we could have produced a 
whole magazine about them. 


SOOCHOW UNIVERSITY 
CHINA 


2or2WAC: $6.08 
ee 


Founded in 1900 during the Qing dynasty, 
Soochow University is named after its 
birthplace of Suzhou on China's east coast 
— 30 minutes from Shanghai via bullet 
train. The institution has strengthened 
key areas of research over the past decade, 
a strategy that has culminated in the dou- 
bling of its Nature Index score since 2012. 
“The university decided to use its limited 
resources to strengthen a few selected 
departments,” says Soochow materials sci- 
ence professor, Lee Shuit-Tong. 

Research centres that focus on central 
government research priorities, such as 
nanotechnology, biomedicine and energy 
research, have boosted the university’s 
published papers overall, while fuelling 
researchers’ drive to produce higher qual- 
ity research, says Lee. 

“Tt was not until 2008 that the develop- 
ment of Soochow University drastically 
sped up,’ says Lee, the founder of the insti- 
tute of functional nano and soft materials 
(FUNSOM), which opened in 2008 with 
US$22 million from the university. FUN- 
SOM contributes more than 40% of the uni- 
versity’s peer-reviewed articles published in 
high-impact journals, with only 2% of the 
total staff. Lee and a colleague were recog- 
nized by Thomson Reuters in its ‘Highly 
Cited Researchers list of 2015. Their team 
is exploring how cellular imaging can 
be used in cancer therapy and in the devel- 
opment of green-energy devices. $0 


INSTITUTE FOR BASIC SCIENCE 
SOUTH KOREA 


aeWFENO 
2015 WEC:SO.1 


For decades, South Korea has propelled its 
economy by investing in applied research 
— often in tandem with industry — while 
basic science was neglected. The Institute 
for Basic Science (IBS) was intended to 
reverse that trend, becoming Korea's equiv- 
alent of Japan's RIKEN and Germany's Max 
Planck Institutes. 

Founded in 2012, IBS is a new player 
even in a line-up of rising stars. It has 
grown to 26 centres with focuses ranging 
from dark matter to gene editing to gra- 
phene to nanomedicine. Each centre has an 
average annual budget of US$8.5 million, 
and there are plans to establish 50 centres 
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TOP 100 MOST IMPROVED 

Countries are sized by the number of institutions in the top 100 institutions with the highest 
increase in their absolute contribution to the index, a metric known as weighted fractional count 
(WFC), between 2012 and 2015. 


UNITED KINGDOM 
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GERMANY 
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GROWTH SPURT 


Since the Institute for Basic Science in 
South Korea was founded in 2012 its WFC 
has skyrocketed. In 2015, 189 papers were 
included in the index. 
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by 2021. About a third of these will be at a 
headquarters campus to be constructed in 
Daejeon; the rest hosted by universities and 
institutes around the country. 

“The funding is generous,” says Jin-soo 
Kim, who heads the Center for Genome 
Engineering. “I don't have to write a grant” 

South Korean leaders hope IBS will 
deliver on the dream ofa scientific Nobel 
prize. “The phenomenon is aggravated 
by Japanese success,” says IBS president 
Doochul Kim, referring to the 21 scientific 
Nobels won by the country’s rival. “Korean 
people have always compared and tried to 
catch up to Japan. Politicians ask us, “What 
have you done?” 

South Korean president, Park Geun-hye, 
has also argued that IBS has many benefits. 
With Korea's economy slowing, the techno- 
logical rise of China, and renewed compe- 
tition from Japan spurred by a weak yen, 
she is betting on IBS and basic science as a 
catalyst for growth. MZ 


EAST CHINA NORMAL UNIVERSITY 
CHINA 


2012 WFC: 35.55 
2015 WFC: 83.90 


East China Normal University (ECNU) 
evolved from a teacher training university 
to a multi-disciplinary institute and is fast 
gaining a reputation for its world-class 
research. Between 2012 and 2015, the uni- 
versity’s contribution to articles in the index 
has more than doubled, lifting it more than 
200 places in the Global 500 ranking to 
142nd position in 2015. 

The university prides itself on its 
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international collaborations, and its most 
recent partnerships have led to fruitful 
research. In 2011, ECNU and New York 
University joined forces to establish Chi- 
nas first Sino-American university — New 
York University Shanghai (NYU Shang- 
hai). A 2015 study by researchers at NYU 
Shanghai's Institute of Brain and Cognitive 
Science found physical evidence that the 
brain arranges words in a hierarchy using 
a form of ‘internal grammar’ The findings, 
published in Nature Neuroscience, suggest 
the human mind can intuitively configure 
a string of words such as ‘ancient history 
is drinking tea into a structure that has 
meaning, such as ‘drinking tea is ancient 
history: $0 


HUNAN UNIVERSITY 
CHINA 


2012 WFC: 49.22 
2015 WFC: 92.47 


Hunan University is one of China's oldest 
and can trace its roots back to the year 976 
CE. The campus sits on the Xiang River in 
Changsha, the capital city of the province 
of Hunan. 

The university has almost doubled its 
contribution to the index between 2012 
and 2015, a rise attributed to its recruit- 
ment of top-level researchers, according to 
Weihong Tan, a professor of chemistry and 
biomedical engineering. In 2013, Hunan 
established the Institute of Chemical Biol- 
ogy and Nanomedicine under the direction 
of leading US scientists, Chad Mirkin from 
Northwestern University and David Walt 
from Tufts University. 

Tan's team are focusing on research in 
biomedicine, which he hopes will lead to 
new industries for the province. $0 


SICHUAN UNIVERSITY 
CHINA 


2012 WFC: 44.88 
2015 WFC: 83.22 


Sichuan University’s position in the Nature 
Index has soared in recent years, climbing 
from 278th place in the Global 500 insti- 
tutions ranking in 2012 to 145th in 2015. 
New facilities, well-funded salary start-up 
packages and accessible research grants 
have attracted eager young researchers to 
the university in Chengdu in southwest 
China. Fifty international researchers have 
been recruited in the past two years to the 
university's National Key Laboratory of 
Biotherapy, which opened in 2005. The 
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multidisciplinary research centre enables 
discovery and development of innovative 
drug candidates. 

“Basic research, preclinical development, 
translational and clinical medicine are 
seamlessly integrated,’ says Wei Yuquan, 
who is director of the laboratory as well as 
the university's vice president. 

He says: “We want to bring disciplines 
together to try to solve the major problems 
in medicine, science and technology.’ $0 


KING ABDULLAH UNIVERSITY OF 


SCIENCE & TECHNOLOGY 
SAUDI ARABIA 


2012 WFC: 40.51 
2015 WFC: 72.19 


KAUST is more than just a university — it’s 
practically a city. Occupying 36 square kilo- 
metres on the Red Sea, KAUST is separated 
from conservative Saudi society. Courses 
are coeducational, women can drive and 
are not required to wear veils, and the cam- 
pus has one of only two sanctioned movie 
theatres in the kingdom. 

The graduate-only research university 
was founded in 2009 at the behest of the 
late King Abdullah. He hoped to create a 
premier institute to kickstart Saudi Arabia's 
research scene — and to help develop clean 
energy technology 
to wean it off oil. 


“We want : 
to brin g He left the univer- 
disciplines sity with an endow- 
tove ie to ment estimated at 
: ; US$20 billion, and 
FONE GY “ it recruited foreign 
bl 8 
ed ems a researchers with 
science an 


start-up funds, 
reportedly worth 
up to US$1 million 
each, to set up labs. In the early days many 
faculty members left, claiming the univer- 
sity had failed to deliver on its promises. 
But the institution’s contribution to high- 
quality research had surged by 2015 to lev- 
els 80% higher than in 2012. MZ 


medicine.” 


NANYANG TECHNOLOGICAL 
UNIVERSITY 

SINGAPORE 

2012 WFC: 177.60 


2015 WFC: 207.83 


Nanyang Technological University is one 
of Asia’s research powerhouses, ranked 
37th in the index’s Global 500. Most of 
the Singaporean university's high-quality 
research output is in chemistry, but it has 
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KAUST’s Center for Desert Agriculture works toward food security solutions in arid climates. 
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The upward 
trajectory of many 
of China’s less 
well-known research 
universities is 
reflected in the WFC 
growth of four 
Chinese universities 
in the top 100. 
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also invested heavily in sustainability and 
environmental research. 

Founded in 1991 through a merger 
between an engineering university and 
the national institute for teacher training, 
it was granted autonomy from the state 
in 2006 and swiftly transformed into a 
research-intensive institute, taking advan- 
tage of increased public funds. MZ 


INDIAN INSTITUTE OF SCIENCE 


EDUCATION & RESEARCH 
INDIA 


2012 WFC: 50.97 
2015 WFC: 78.67 


The Indian Institute of Science Education 
and Research (IISER), in Pune, was only 
founded in 2006. A decade later, the Indian 
government has established six additional 
IISERs, in Kolkata, Mohali, Bhopal, Thiru- 
vanthapuram, Tirupati and, most recently, 
in Berhampur. Collectively, they have 
risen to become the fourth-ranked Indian 
organization in the Global 500 by boosting 
their contribution to high-quality science 
by more than 50% since 2012. 

The IISERs were set up to build on the 
success of India’s highest entrant to the 
Global 500, the Indian Institutes of Tech- 
nology, says Haripriya Rangan from the 
Australia India Institute in Melbourne. 
“We have a very strong reputation over- 
seas, but in India there has been not much 
reflection of that,’ she says. 

The IISERs, with an emphasis on 
fundamental scientific research, might 
address that. “They have been well 
funded, with state-of-the-art facilities,” 
says Kartik Shanker, an ecologist at the 
IIT in Bangalore. “They have hired good 
young, faculty with very active research 
programmes.” 

But, the IISERs have their critics. The 
institutions were set up to span the funda- 
mental sciences, and yet there has been a 
perceived bias toward chemistry, sources 
say. The Scientific Advisory Council to 
the Prime Minis- 
ter, which recom- 


“We have 
mended the IISERs 
astrong : 

” be established, is 
reputation headed 
overseas, but ois ie uaeae 
ss Foils th inent chemist, and 
da ne ag chemists were 
isnot eae appointed as direc- 
reflection tors to four of the 
of that. first five IISERs to 


open. In the index, 
chemistry research accounts for almost 
80% of the organizations’ contribution to 
world-class natural science research. 
JMC 
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ISRAEL INSTITUTE OF TECHNOLOGY 


(THE TECHNION) 
ISRAEL 


2012 WFC: 68.75 
2015 WFC: 95.59 


It’s difficult to pinpoint the factors 
that have pushed the Technion up the 

Nature Index Global 500 ranking of 
top-performing research organisations, 
according to Ehud Keinan, a chemist at the 
institution and, since 2009, the president of 
the Israel Chemical Society. “If the recipe 
was easy and clear, everybody would do it” 
But the Technion’s chemists have certainly 
got some of the reagents right. 

Established in the port city of Haifa in 
1912, Israel’s oldest university has 18 aca- 
demic departments, including all scientific 
areas from engineering to medicine. But, of 
the university's 565 academic faculty, it’s pri- 
marily the chemists that have powered the 
Technion’s rise up the index rankings, from 
position 179 in 2012 to 108 in 2015. They 
have more than doubled their contribution 
to chemistry papers published in top-tier 
journals since 2012, the first year the index 
was compiled. 

The Technion’s overall contribution to 
high-quality science papers jumped almost 
40% in that time. 

Keinan identifies one probable reason for 
the institute's performance: the awarding of 
the 2004 Nobel Prize in chemistry to Tech- 
nion researchers Aaron Ciechanover and 
Avram Hershko for their discovery of how 
the body tags proteins for recycling. This 
success was followed in 2011 by Technion 
chemist Dan Shechtman who won the same 
gong for his discovery of quasicrystals, an 
unexpected form of matter that combines 
order and disorder. 

“Success is an autocatalytic process,” 
Keinan says. “When you get good faculty, 
you can attract even better faculty members, 
and good graduate students.” 

Keinan is now working to establish a 
branch of the Technion in Guangdong, China, 
to leverage its strength in chemistry. JMC 


BAYLOR COLLEGE OF MEDICINE 
UNITED STATES 


2012 WFC: 66.29 
2015 WFC: 92.15 


Baylor College of Medicine (BCM) is the 
self-proclaimed “intellectual heart’ of the 
world’s largest medical complex, Houston's 
Texas Medical Centre. 

Despite its enormity now, BCM was 
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The neurosurgery department at Baylor College of Medicine, whose other areas of strength are 
paediatrics, immunology and the study of cardiovascular disease. 


small for much of its 116-year history. A 
key figure in setting the college on its cur- 
rent trajectory was a paediatrician called 
Ralph Feigin. 

“Feigin was an impressive dude and a 
very successful leader,’ says Scott Simon, a 
former faculty member, now at the Univer- 
sity of California, Davis. Feigin led the pae- 
diatrics department from 1977 to 2008, and 
was Baylor’s president and CEO from 1996 
to 2003. “I’ve never seen a physician who 
was as good an administrator,’ Simon adds. 

Baylor has grown by building on existing 
strengths, says Simon — including paedi- 
atrics, immunology and cardiovascular 
disease. “They went for three or four areas 
they did really well at, and pulled in the best 
people — and then other programmes grew 
around it” 

In 2009 and 2010, BCM considered 
merging with neighbouring Rice University 
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or with Baylor University in Dallas, from 
which it had split in 1969 — but decided 
to stay solo. Remaining independent hasn't 
hurt its output: BCM’s contribution to 
high-quality research rose 39% from 2012 
to 2015. JMC, MZ 


NORTH CAROLINA STATE UNIVERSITY 
UNITED STATES 


2012 WFC: 51.71 
2015 WFC: 76.29 


Located in Raleigh, North Carolina State 
University is at the heart of a region quickly 
becoming a major research and technology 
hub. In 2015, Forbes magazine reported it 
had the highest rate of STEM job growth in 
the US for the preceding decade, topping 


Silicon Valley. NCSU is one of the three 
academic anchors of the so-called Research 
Triangle, along with Duke University in 
Durham and the University of North Car- 
olina in Chapel Hill. More than 100 start- 
ups or spinoffs have emerged from North 
Carolina State, attracting $US1.6 billion in 
venture capital. 

Both its total number of articles in the 
index and its contribution to those arti- 
cles spiked by roughly 50% between 2012 
and 2015, vaulting 91 places in the index’s 
Global 500 to rank 164. Traditionally 
strong in engineering, it hosts two United 
States National Science Foundation (NSF) 
Engineering Research Centers: one that 
focuses on smart electrical grids and one 
specializing on health wearables. 

In the mid-2000s, North Carolina State 
become mired in controversy by claims 
that three of its scientists had falsified data 


in a 2004 study finding that RNA could 
regulate the growth of the metal palladium. 
A university investigation in 2008 found 
the paper, which had been published in 
Science, included false data, but stopped 
short of finding intentional misconduct. 
A more thorough 2013 investigation by 
the NSF inspector general concluded the 
pair had indeed “recklessly falsified data,” 
although the NSF eventually ruled it had not 
been intentional. Science retracted the paper 
in February 2016. The scientists in question 
no longer work at the university. MZ 


XV'AN JIAOTONG UNIVERSITY 
CHINA 


2012 WFC: 30.27 
2015 WFC: 50.78 


In the past four years, Xi'an Jiaotong Uni- 
versity, in the ancient former capital city of 
Xian in central China, increased its high- 
quality natural science in all subject areas, 
but most dramatically in chemistry and the 
physical sciences. 

The university is home to the State 
Key Laboratory for Mechanical Behav- 
ior of Materials. State key laboratories are 
centrally-funded institutions that are 
designed to funnel resources to govern- 
ment priority areas. Materials science and 
its industrial applications are considered 
key to China’s economic development. 

The team at Xi'an Jiaotong are explor- 
ing ways in which materials behave at the 
nano scale. They recently developed a 
nanospring that can efficiently store and 
release mechanical energy, a component 
that could be useful in more complex 
nanomechanical systems. 

Four years ago, Xian Jiaotong estab- 
lished a multidisciplinary research insti- 
tute called the Frontier Institute of Science 
and Technology (FIST) which now has 11 
centres focusing on life sciences, medicine 
and mathematics, and has recruited 40 
international scientists. Among them are 
Timothy Bliss and Graham Collingridge 
from the UK, who in March 2016 were 
awarded the European ‘brain prize’ for their 
work on memory mechanisms. $0 


HARBIN INSTITUTE OF TECHNOLOGY 
CHINA 


2012 WFC: 19.30 
2015 WFC: 39.73 


In the capital of the far northern province 
of Heilongjiang, the Harbin Institute of 
Technology (HIT) is synonymous with 
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“China’s research sup- 
goals include porting the 
establishing a country's space 
space station SEER Ce EO 
and sending gramme. China's 
astronauts to 8° oe — 

establishing a 
Mars.” 


space station, 

sending astro- 
nauts to Mars, and industrializing rocket 
technology to more efficiently transport 
vessels into space. 

Over the past decade, Harbin’s research- 
ers have worked on China’s Shenzhou 
spacecraft programme, including its first 
manned spacecraft, Shenzhou 5, and more 
recently, a satellite for emergency data 
monitoring and imaging, Kuaizhou-l, 
which was launched in 2013. 

The university, established in 1920, is a 
member of China’s elite C9 alliance — nine 
institutes in China that receive 10% of the 
country’s research and development fund- 
ing, and are relied on by the central govern- 
ment to produce world-class results. 

HIT has experienced rapid growth in 
the index, doubling its contribution to top 
journals in four years and moving up 208 
places in the Global 500 to reach 304th 
position in 2015. $0 


UNIVERSITY OF REGENSBURG 
GERMANY 


2012 WFC: 56.86 
2015 WFC: 72.58 


The University of Regensburg’s improve- 
ment in the index is driven mostly by 
chemistry, with the university placing 11th 
among the country’s academic chemistry 
institutions in 2015. 

As head of the chemistry department, 
Oliver Reiser points out, “that puts Regens- 
burg above many universities a lot bigger 
than us.” The department has a mere 18 
tenured academics, compared to more 
than 25 at the University of Stuttgart 
(ranked 13th) and more than 40 at the Ruhi 
University Bochum (19th). 

In 2010, Regensburg, in south-east 
Germany, won special status from the 
national funding agency for its studies 
of photocatalysis — chemical reactions 
accelerated by sustainable sunlight — 
which secured the department additional 
funding until 2019. 

Regensburg’s physics department hosts 
a similar centre researching carbon-based 
nanostructures. Overall, the univer- 
sity’s contribution to index papers grew 
by almost 28%, the largest surge by any 
German institution in the top 100 most 
improved institutions. JMC 
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ITALIAN INSTITUTE OF TECHNOLOGY 
ITALY 


2012 WFC: 22.42 
2015 WFC: 41.39 


When the Italian government decided to 
establish a research organization in 2004, 
it sought inspiration from the world- 
renowned research organizations of Ger- 
many, in particular the Max Planck Society. 

The Max Planck Society is funded by the 
state, but operates as a private foundation. 
“They are a little bit more agile than uni- 
versities, says Roberto Cingolani, scientific 
director at the Italian Institute of Technol- 
ogy, which opened its main laboratory in 
Genoa in 2009. 

The institute’s 1,300 research staff are 
spread across 11 labs around the country 
and two small units at premier US uni- 
versities, Harvard and the Massachusetts 
Institute of Technology. Since 2012, the 
institute’s scientists increased their contri- 
bution to index papers more than any other 
Italian institution. 

The institute concentrates on nanotech- 
nology, robotics and brain science. The 
work is more connected than it might first 
sound. “Our general mission is ‘translating 
evolution into technology,’ Cingolani says. 
The institute’s best-known piece of research 
is its humanoid research robot, iCub, devel- 
oped to test the idea that a robot, like an 
infant human, can develop an understand- 
ing of the world around it via physical inter- 
action with its environment. JMC 


UNIVERSITY OF SUSSEX 
UNITED KINGDOM 


2012 WFC: 18.09 
2015 WFC: 33.26 


There's no great mystery behind the rapid 
rise of the University of Sussex in the index. 
In 2012, the university didn’t place in the first 
Global 500 ranking. But that year, it began 
strategic financial investment in key disci- 
plines, in particular, physics and astronomy. 
The number of research staff has increased 
by more than 60%, from 23 in 2012/13 to 40 
today and is still growing, says astrophysicist, 
Peter Coles, who leads the school of math- 
ematical and physical sciences. 

Recruits were brought in to reinforce 
existing strengths in astronomy and particle 
physics, and to establish programmes in 
quantum technologies and condensed 
matter physics. 

Mainly thanks to a quadrupling of its 
physical sciences score, the university 
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The Italian Institute of Technology’s iCub was built to test whether a robot can learn through 
physical interaction. 
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reached 351 in the Global 500 in 2015. That 
represents an 83.9% rise in its contribution 
to index papers since 2012 — the biggest 
jump of any UK research organisation in 
the top 100 most improved institutions. JMC 


PERIMETER INSTITUTE FOR 


THEORETICAL PHYSICS 
CANADA 


2012 WFC: 32.26 
2015 WFC: 46.54 


When Canadian prime minister, Justin 
Trudeau, visited the Perimeter Institute for 
Theoretical Physics in Waterloo, Ontario in 
April 2016, he wowed reporters by nailing 
a 30-second explanation of quantum 
computing that instantly went viral. But, 
the telegenic PM’s visit didn’t just burnish 
his image. He announced the Ontario 
government would contribute CDN$50 
million (US$38 million) to its budget over 
five years. 

That’s only one funding avenue this 
unique public-private institute attracts. 
Since its founding in 2000, it has lured 
funding from foundations, corporate 
sponsors, and philanthropists — including 
CND$150 million from its founder, 
Mike Lazaridis, also the co-founder of 
BlackBerry. The institute now has 25 full- 
time faculty and counts famed theoretical 
physicists Stephen Hawking and Leonard 
Susskind among its 160 scientists-in- 
residence. The institute’s contribution to 
index papers increased by 44% between 
2012 and 2015, pushing it from position 
362 in the index’s Global 500 ranking in 
2012 to position 267 in 2015. MZ 


JAGIELLONIAN UNIVERSITY 
POLAND 


2012 WFC: 17.37 
2015 WFC: 31 


The standing of Jagiellonian University in 
Krakow, Poland, has fluctuated since its 
foundation in 1364, often buffeted by shifts 
in the balance of power in central Europe. 
But, the oldest university in Poland is also 
the country’s rising star, having increased its 
contribution to high-quality research papers 
by 79%, the largest increase of any Polish 
institution in the top 100 most improved 
institutions. 

In 2010, Jagiellonian was awarded a Euro- 
pean Union regional development grant 
to host Poland’s first synchrotron. Com- 
pleted in late 2015, the synchrotron has 


yet to contribute directly to the university's 
performance in the index. However, the 
synchrotron was integral to Jagiellonian’s 
successful 2012 bid, with partner research 
organizations in Krakow, to become 
Poland's flagship research centre for phys- 
ics and chemistry — a status that guaran- 
tees additional government funding. The 
research financed by this grant is already 
helping boost Jagiellonian up the Global 500 
in which it ranked 370 in 2015. JMC 


FRENCH NATIONAL INSTITUTE FOR 


HEALTH AND MEDICAL RESEARCH 
FRANCE 


2012 WFC: 72.95 
2015 WFC: 85.55 


The French government shook up its sci- 
ence funding in 2009, introducing a €47 
billion ‘Investments for the Future’ fund to 
boost national competitiveness in targeted 
sectors. Some €5 billion of the fund was 
allocated to biomedical research, according 
to Yves Lévy, CEO of the French National 
Institute for Health and Medical Research 
(INSERM). “Given INSERM’s focus, we 
were very well placed and have won many 
of these grants,” Lévy says. All INSERM’s 
researchers work in research units embed- 
ded in French universities, most of them in 
university hospitals. 

Since 2012, INSERMS’s contribution to 
the index grew the most of any French insti- 
tution in the top 100 most improved list, 
and it placed 136 in the Global 500 ranking 
in 2015. But as Lévy notes, some of its high- 
est impact research is published in medical 
journals such as the New England Journal 
of Medicine, which are not included in the 
index. JMC 


UNIVERSITY OF COPENHAGEN 
DENMARK 


2012 WFC: 112.63 
2015 WFC: 125.32 


The oldest and most prestigious university 
in Denmark, the University of Copenhagen 
ranked 72nd globally in the index in 2015, 
with strengths in life and physical sciences. 
The university's researchers published 25% 
more articles in top natural science journals 
in the index in 2015 than in 2012. 

The rise came despite declines in state 
funding, which the university sought to 
offset with external sources, including six 
major laureate grants from the Novo Nor- 
disk Foundation between 2012 and 2014 
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totalling US$36 million. 

But, the funding cuts are now a major 
crisis that could jeopardize the university's 
gains: In February 2016, due to budget con- 
straints, the university announced it would 
lay off 7% of staff — 500 people, including 
more than 100 researchers — and impose a 
hiring freeze. MZ 


SAO PAULO STATE UNIVERSITY 
BRAZIL 


2012 WFC: 11.21 
2015 WFC: 23.52 


Sao Paulo State University (UNESP) is fairly 
new to Brazil's burgeoning research scene, 
but it arrived with intent. Founded in 1976 
by a merger of previously independent 
institutions, this multi-campus university 
is modelled on the University of Califor- 
nia’s state-wide system and consists of 24 
campuses across Sao Paulo State, including 
its eponymous capital. In 2009, it embarked 
ona hiring frenzy of more than 1,000 faculty 
and researchers, and from 2012 to 2015, its 
contribution to high-quality science in the 
index more than doubled. 

The university’s research strengths are 
in physics and astronomy. Its Institute for 
Theoretical Physics was founded in 1951, 
predating the university itself. It also hosts 
the ICTP South American Institute for 
Fundamental Research, an offshoot of the 
International Centre for Theoretical Physics 
in Trieste, Italy. 

However, Brazil's economic crisis has 
put science funding on the chopping block, 
cutting it by 25% in 2015. After the May 
impeachment of President Dilma Rousseff, 
the science ministry was merged with the 
telecommunications ministry, leaving many 
researchers wondering if science will remain 
downgraded for many years to come. MZ 


UNIVERSITY OF BASEL 
SWITZERLAND 


2012 WFC: 80.50 
2015 WFC: 92.77 


Switzerland's city of Basel is a hub for life 
sciences research and home to the head- 
quarters of pharmaceutical giants, Novartis 
and Roche. In 2007, the University of Basel 
decided to play to this strength and make 
life science its key focus. 

This concentration of effort has helped 
power the university's 40-place leap up the 
Global 500 since 2012, to 109 in 2015. 

The university's strategy was supported 
by the regional government, says Christian 
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Sengstag, deputy vice-rector for research at 
the university. “Lots of resources have been 
pumped into life science research,” he says. 
New research groups have been established, 
and the university has invested “in top-notch 
infrastructure and buildings.’ Last year Basel 
scientists used single-molecule spectroscopy 
to show how E. coli bacteria fold the proteins 
that form their outer membrane — a long- 
standing drug discovery target. 

Political support can be fickle, however. 
Federal elections held in 2015 saw a strong 
swing toward the right-wing SVP party, 
which has vowed to cut government spend- 
ing. “In the future we will probably have to 
live on a lower budget,’ says Sengstag. JMC 


SAINT PETERSBURG STATE 


UNIVERSITY 
RUSSIA 


2012 WFC: 13.97 
2015 WFC: 25.36 


Saint Petersburg State University is known 
for educating generations of Russia’ elite: 
it counts current president Vladimir Putin 
and prime minister Dmitry Medvedev 
among its alumni. 

After the fall of the USSR in 1991, fund- 
ing plummeted and national science output 
slipped. But, budgets are slowly building 
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Chris Spencer of Curtin University at the basalt outcrop of the Semail Ophiolite in Wadi Jizzi, Oman. 


up, and the country’s oldest university leapt 
into the Global 500 in the index in 2015, 
at position 410, by more than doubling its 
contribution to high-quality science over 
the previous four years. 

Part of its upswing has come from 
increased state funds, including seven pro- 
jects funded by the Russian education and 
science ministry’s so-called ‘mega-grants. 
These grants of 90-150 million roubles (US 
$1.3-2.2 million each) are open to scientists 
around the globe, and have played a key role 
in the nation’s strategy to attract foreign tal- 
ent and revitalize its research scene. Three 
of the university’s mega-grant projects 
involve nanomaterials, reflecting a tradi- 
tional strength in chemistry and physical 
sciences, but others focus on the geomor- 
phology of the polar regions, genomics and 
bioinformatics. MZ 


UMEA UNIVERSITY 
SWEDEN 


2012 WFC: 19.64 
2015 WFC: 30.84 


Sweden's temperate environment is a world 
away from tropical Latin America, where 
the Zika virus continues to wreak havoc, but 
researchers at the country’s Umea University 
have joined the fight to stop Zika spreading 
further. Global health is one of Umea’s key 
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strengths, and that includes an EU-funded 
consortium, established at the university in 
2011 to fight Dengue fever. (Zika and Den- 
gue are transmitted by some of the same spe- 
cies of mosquito.) 

Umea University’s contribution to high- 
quality science increased by 57% from 2012 
to 2015, placing it in the top 400 institutions 
globally. Founded in 1965, the university pri- 
oritizes biotechnology and life sciences, and 
its regional forests in particular have yielded 
plenty of research. In 2013, The Umea Plant 
Science Center, a collaboration between 
Umea University and the Swedish Univer- 
sity of Agricultural Sciences, mapped the 
Norway spruce genome, which comprises 
20 billion base pairs — more than six times 
as many as the human genome. Its energy 
research portfolio also includes developing 
biofuels from trees. MZ 


CURTIN UNIVERSITY 
AUSTRALIA 


2012 WFC: 13.23 
2015 WFC: 24.16 


The state of Western Australia has two dis- 
tinguishing qualities: it is rich in mineral 
resources and its interior is sparsely popu- 
lated. Both factors feed into Curtin Univer- 
sity’s focus on Earth sciences. 

A decade ago, the university, located in 
Perth, reorganized its mining school to 
incorporate key aspects of applied geol- 
ogy, says Graeme Wright, Curtin’s deputy 
vice-chancellor for research. Wright cites 
this transformation as the key to the 54% 
increase in Curtin’s contribution to articles 
in Earth and environmental sciences in 
the index since 2012. Overall, the univer- 
sity’s contribution to journals in the index 
increased more than 80%. 

Western Australia’s vast, people-free inte- 
rior also makes it among the most radio- 
quiet places on the planet and an ideal site 
for the Square Kilometre Array (SKA), an 
internationally funded next-generation 
radio telescope, scheduled to be fully oper- 
ational in 2030. The SKA will peer deeper 
into the universe than ever before and help 
answer questions about how stars, galax- 
ies and the universe formed and evolved. 
The state's ‘pathfinder’ radio telescopes are 
already boosting Curtin’s physical sciences 
index score. “From small beginnings, we 
built a strong radioastronomy group,’ says 
Wright. 

For Wright, the key to Curtin’s index suc- 
cess is its collaborations. “We are a mod- 
estly-sized university. We know we've got 
great people, but that there are even better 
people outside. That’s key to what we do.” 
JMC 
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A GUIDE TO THE NATURE INDEX 


A description of the terminology and methodology used in this supplement, 
and a guide to the functionality available free online at natureindex.com. 


he Nature Index is a database of author 

affiliations and institutional relation- 

ships. The index tracks contributions 

to articles published in a group of highly selec- 

tive science journals, chosen by an independ- 
ent group of active researchers. 

The Nature Index provides absolute counts 
of publication productivity at the institutional 
and national level and, as such, is one indica- 
tor of global high-quality research output. 

Data in the Nature Index are updated 
monthly, with the most recent 12 months of 
data made available under a Creative Com- 
mons licence at natureindex.com. 

The database is compiled by Nature Pub- 
lishing Group (NPG). The list of journals 
tracked by the Nature Index is under review, 
and from 2016 will be extended to include the 
clinical sciences. 


NATURE INDEX METRICS 

There are four measures provided by the Nature 
Index to track affiliation data. The simplest is 
the article count (AC). A country or institution 
is given an AC of 1 for each article that has at 
least one author from that country or institu- 
tion. This is the case whether an article has one 
or a hundred authors, and it means that the 
same article can contribute to the AC of multi- 
ple countries or institutions. 

To get a sense of a country or institution's 
contribution to an article, and to remove the 
possibility of counting articles more than once, 
the Nature Index uses the fractional count (FC), 
which takes into account the relative contri- 
bution of each author to an article. The total 
FC available per paper is 1, which is shared 
between all authors under the assumption that 
each contributed equally. For instance, a paper 
with 10 authors means that each author receives 
an FC of 0.1. For authors who have joint affili- 
ations, the individual FC is then split equally 
between each affiliation. 

The third measure used is the weighted frac- 
tional count (WFC), which applies a weighting 
to the FC to adjust for the over-representation of 
papers in astronomy and astrophysics. The four 
journals in these disciplines publish about 50% 
of all papers in international journals in this 
field — approximately five times the equivalent 
percentage for other fields. Therefore, although 
the data for astronomy and astrophysics are 
compiled in the same way as for all other disci- 
plines, articles from these journals are assigned 
one-fifth the weight of other articles (i.e., the FC 
is multiplied by 0.2 to derive the WFC). 

The total FC or WFC for an institution is 
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NATUREINDEX.COM 


A global indicator of high-quality research 


natureindex.com 
users can search for 
specific institutions 
or countries and 
generate their own 
reports, ordered by 
article count (AC), 
fractional count (FC) 
or weighted fractional 
count (WFC). 

Each query will 
return a profile page 
that lists the country 
or institution’s recent 
research outputs, 
from which it is 
possible to drill down 
for more information. 
For example, articles 
can be displayed by 
journal, and then by 
article title. As in the 
supplement, research 
outputs are organized 


Home 


Home Institution outputs 


Institution name 
Country 


Region: Global 
Subject/journal group: All 


Below, the same research outputs are grouped by 
subject. Click on the subject to drill-down into a list 
of articles organized by journal, and then by title. 


subject area. 


The table to the right includes counts of all 
research outputs for Institution name published 
between 1 January 2014 - 31 December 2014 
which are tracked by the Nature Index. 


Note: Articles may be assigned to more than one 


nature INDEX 


Institution outputs 


Country outputs Customer support FAQ 


Institution name 


Collaboration Relationships 


1 January 2014 - 31 December 2014 


WFC 


1221 598.04 558.30 


Outputs by subject 


by subject area. The Subject AC FC WFC 
profile page also BE Chemistry 276 «179.1 = 179.14 
lists the institution |_| Earth & Environmental Sciences 95 42.73 42.73 


or country’s top 
collaborators, as well 
as its relationship 
with other research 
organizations. 


|_| Life Sciences 


| | Physical Sciences 


calculated by summing the FC or WEC for 
individual authors. 

The process is similar for countries, 
although complicated by the fact that some 
institutions have overseas labs that will be 
counted towards their host country totals. 
What's more, there is great variability in the 
way authors present their affiliations. 

Every effort is made to count affiliations 
consistently, with a background of reasonable 
assumptions. 

The fourth metric is collaboration score. 
This is a relatively new metric that is derived 
by taking the average of the FCs for all the 
bilateral relationships for that institution 
or country. 

If institution A has relationships with two 


© 2016 Macmillan Publishers Limited. All rights reserved. 


< Return to institution outputs 


439 231.50 231.50 


652 284.48 244.74 


others, B and C, then the collaboration score 
is average of the FC for A+ Band A+C. 

For more information on how the affiliation 
information is processed and counted, please 
see the FAQ section at natureindex.com. 


THE SUPPLEMENT 

Nature Index 2016 Rising Stars is based on 
data from the Nature Index, covering articles 
published during four consecutive years from 
1 January 2012 to 31 December 2015. 

Most analyses within the supplement 
use WFC as the primary metric, as it provides 
a more even basis for comparison across 
multiple disciplines, and in determining 
the relative contribution of each country or 
institution. m 


